modelgrep
G

Google: Gemini 3.1 Flash Lite Preview

google/gemini-3.1-flash-lite-preview

64th smartest of 178ReasoningToolsJSONVisionAudio
Use via OpenRouter ↗
Intelligence
33.5
64th of 178
Design Elo
1211
asciiart
Speed
96
71st fastest
Latency
624ms
first token
Input price
$0.250
111th cheapest
Context
1.0M
66K max out

How it compares

Smarter than64%
of all ranked models
Faster than76%
of all ranked models
Cheaper than63%
of all ranked models

Overview

Gemini 3.1 Flash Lite Preview is Google's high-efficiency model optimized for high-volume use cases. It outperforms Gemini 2.5 Flash Lite on overall quality and approaches Gemini 2.5 Flash performance across...

Benchmarks

independent · via OpenRouter
Artificial Analysis61th percentile
Intelligence Index
33.5
Coding Index
30.1
Agentic Index
25.7
GPQA Diamond
82%
Humanity's Last Exam
16%
SciCode
42%
Tau²-Bench (agentic)
31%
Design Arena · Elo12,898 tournaments
asciiart
1211
3D
1129
UI Component
1128
Website
1126
codecategories
1123
svg
1109
Game Dev
1100
Data Viz
1090

Providers & pricing (2)

ProviderIn $/MOut $/MUptime
Google AI Studiocache$0.250$1.5098.3%
Googlecache$0.250$1.5097.8%

Specifications

Context window1.0M
Max output66K
Knowledge cutoff
Input modalitiestext, image, video, file, audio
Output modalitiestext
Prompt cachingSupported
Cache read price$0.025/M
ModeratedNo

Gemini 3.1 Flash Lite Preview FAQ

How much does Gemini 3.1 Flash Lite Preview cost?

Gemini 3.1 Flash Lite Preview costs $0.250 per million input tokens and $1.50 per million output tokens via OpenRouter, making it 111th cheapest of 298 paid models.

How smart is Gemini 3.1 Flash Lite Preview?

Gemini 3.1 Flash Lite Preview scores 33.5 on the Artificial Analysis Intelligence Index, ranking 64th of 178 benchmarked models, with a GPQA Diamond score of 82%.

How fast is Gemini 3.1 Flash Lite Preview?

Gemini 3.1 Flash Lite Preview generates around 96 tokens per second with 624ms time-to-first-token (p50), the 71st fastest tracked model.

What is Gemini 3.1 Flash Lite Preview's context window?

Gemini 3.1 Flash Lite Preview supports a 1.0M-token context window and can output up to 66K tokens. It accepts text, image, video, file, audio input.

Compare head-to-head