google/gemini-3.1-flash-lite
Gemini 3.1 Flash Lite is Google’s GA high-efficiency multimodal model optimized for low-latency, high-volume workloads. It supports text, image, video, audio, and PDF inputs, and is designed for lightweight agentic...
| Provider | In $/M | Out $/M | Context | Uptime |
|---|---|---|---|---|
| Googlecache | $0.250 | $1.50 | 1.0M | 96.8% |
| Google AI Studiocache | $0.250 | $1.50 | 1.0M | 99.5% |
Gemini 3.1 Flash Lite costs $0.250 per million input tokens and $1.50 per million output tokens via OpenRouter, making it 107th cheapest of 298 paid models.
Gemini 3.1 Flash Lite generates around 104 tokens per second with 623ms time-to-first-token (p50), the 55th fastest tracked model.
Gemini 3.1 Flash Lite supports a 1.0M-token context window and can output up to 66K tokens. It accepts text, image, video, file, audio input.