google/gemini-2.5-flash-lite-preview-09-2025
Gemini 2.5 Flash-Lite is a lightweight reasoning model in the Gemini 2.5 family, optimized for ultra-low latency and cost efficiency. It offers improved throughput, faster token generation, and better performance...
| Provider | In $/M | Out $/M | Context | Uptime |
|---|---|---|---|---|
| $0.100 | $0.400 | 1.0M | 99.6% |
Gemini 2.5 Flash Lite Preview 09-2025 costs $0.100 per million input tokens and $0.400 per million output tokens via OpenRouter, making it 54th cheapest of 298 paid models.
Gemini 2.5 Flash Lite Preview 09-2025 scores 19.4 on the Artificial Analysis Intelligence Index, ranking 121st of 178 benchmarked models, with a GPQA Diamond score of 65%.
Gemini 2.5 Flash Lite Preview 09-2025 generates around 217 tokens per second with 385ms time-to-first-token (p50), the 15th fastest tracked model.
Gemini 2.5 Flash Lite Preview 09-2025 supports a 1.0M-token context window and can output up to 66K tokens. It accepts text, image, file, audio, video input.