google/gemini-2.5-flash-lite
Gemini 2.5 Flash-Lite is a lightweight reasoning model in the Gemini 2.5 family, optimized for ultra-low latency and cost efficiency. It offers improved throughput, faster token generation, and better performance...
| Provider | In $/M | Out $/M | Context | Uptime |
|---|---|---|---|---|
| $0.100 | $0.400 | 1.0M | 99.7% | |
| $0.100 | $0.400 | 1.0M | 99.5% | |
| Google AI Studio | $0.100 | $0.400 | 1.0M | 99.8% |
Gemini 2.5 Flash Lite costs $0.100 per million input tokens and $0.400 per million output tokens via OpenRouter, making it 57th cheapest of 298 paid models.
Gemini 2.5 Flash Lite scores 17.6 on the Artificial Analysis Intelligence Index, ranking 130th of 181 benchmarked models, with a GPQA Diamond score of 63%.
Gemini 2.5 Flash Lite supports a 1.0M-token context window and can output up to 66K tokens. It accepts text, image, file, audio, video input.