modelgrep

Fastest Amazon Models

Quick answer · Updated June 2026

The fastest Amazon model is Nova 2 Lite at 119 output tokens per second. Nova Micro 1.0 (97 t/s) and Nova Lite 1.0 (76 t/s) round out the top three.

119 t/sSpeed
24.6Intelligence
$0.300Input /M
1MContext

AI models ranked by output speed (tokens per second, p50). The fastest large language models for low-latency and high-throughput applications.

  1. 1A
    nova-2-lite-v1
    ReasoningToolsVision24.6 intel · $0.300/M · 544ms ttft
    119 t/s
    Speed
  2. 2A
    nova-micro-v1
    Tools10.3 intel · $0.035/M · 322ms ttft
    97 t/s
    Speed
  3. 3A
    nova-lite-v1
    ToolsVision12.7 intel · $0.060/M · 482ms ttft
    76 t/s
    Speed
  4. 4A
    nova-pro-v1
    ToolsVision13.5 intel · $0.800/M · 965ms ttft
    41 t/s
    Speed
  5. 5A
    nova-premier-v1
    ToolsVision19.0 intel · $2.50/M · 13.2s ttft
    12 t/s
    Speed

Frequently asked

What is the fastest Amazon model?

The fastest Amazon model is Nova 2 Lite at 119 output tokens per second. Nova Micro 1.0 (97 t/s) and Nova Lite 1.0 (76 t/s) round out the top three.

What's a good alternative to Nova 2 Lite?

Nova Micro 1.0 (97 t/s) is the closest alternative on this metric, followed by Nova Lite 1.0 (76 t/s). See the full ranking above for the tradeoffs.

How many Amazon models are there?

modelgrep tracks 5 Amazon models with live benchmarks, speed, latency and per-provider pricing, led on intelligence by Nova 2 Lite. 5 of them qualify for this ranking.

More Amazon rankings

All rankings