Lowest-Latency Amazon Models

Quick answer · Updated June 2026

Nova Micro 1.0 has the lowest latency of any Amazon model, responding in about 322ms to first token. Nova Lite 1.0 (482ms) and Nova 2 Lite (544ms) round out the top three.

322msLatency

10.3Intelligence

97 t/sSpeed

$0.035Input /M

128KContext

AI models ranked by time-to-first-token (p50). The most responsive large language models for real-time and interactive use cases.

Frequently asked

Which Amazon model has the lowest latency?

Nova Micro 1.0 has the lowest latency of any Amazon model, responding in about 322ms to first token. Nova Lite 1.0 (482ms) and Nova 2 Lite (544ms) round out the top three.

What's a good alternative to Nova Micro 1.0?

Nova Lite 1.0 (482ms) is the closest alternative on this metric, followed by Nova 2 Lite (544ms). See the full ranking above for the tradeoffs.

How many Amazon models are there?

modelgrep tracks 5 Amazon models with live benchmarks, speed, latency and per-provider pricing, led on intelligence by Nova 2 Lite. 5 of them qualify for this ranking.

More Amazon rankings

Amazon: Smartest LLMs Amazon: Best LLMs for Coding Amazon: Best LLMs for Design & Frontend Amazon: Fastest LLMs Amazon: Cheapest LLMs Amazon: Best Free LLMs Amazon: Best Reasoning LLMs Amazon: Best Vision LLMs Amazon: Best LLMs for Agents Amazon: Best Open-Source LLMs Amazon: Longest-Context LLMs

All rankings

Small & Fast LLMs Smartest LLMs Best LLMs for Coding Best LLMs for Design & Frontend Fastest LLMs Cheapest LLMs Best Free LLMs Best Reasoning LLMs Best Vision LLMs Best LLMs for Agents Best Open-Source LLMs Longest-Context LLMs