Google: Gemini 3.1 Flash Lite

google/gemini-3.1-flash-lite

105th smartest of 182ReasoningToolsJSONVisionAudio

Use via OpenRouter ↗

Intelligence

25.0

105th of 182

Design Elo

—

Speed

—

tokens/sec

Latency

—

first token

Input price

$0.250

110th cheapest

Context

1.0M

66K max out

How it compares

Smarter than42%

of all ranked models

Cheaper than67%

of all ranked models

Overview

Gemini 3.1 Flash Lite is Google’s GA high-efficiency multimodal model optimized for low-latency, high-volume workloads. It supports text, image, video, audio, and PDF inputs, and is designed for lightweight agentic...

Benchmarks

independent · Artificial Analysis & Design Arena

Artificial Analysis68th percentile

Intelligence Index

25.0

Coding Index

34.7

GPQA Diamond

82%

Humanity's Last Exam

16%

SciCode

42%

Tau²-Bench (agentic)

31%

Providers & pricing (6)

Provider	In $/M	Out $/M	Context	Uptime
Googlecache	$0.250	$1.50	1.0M	98.2%
Googlecache	$0.125	$0.750	1.0M	98.2%
Googlecache	$0.450	$2.70	1.0M	98.2%
Google AI Studiocache	$0.250	$1.50	1.0M	98.8%
Google AI Studiocache	$0.125	$0.750	1.0M	98.8%
Google AI Studiocache	$0.450	$2.70	1.0M	98.8%

Specifications

Context window1.0M

Max output66K

Knowledge cutoff—

Input modalitiestext, image, video, file, audio

Output modalitiestext

Prompt cachingSupported

Cache read price$0.025/M

ModeratedNo

Gemini 3.1 Flash Lite FAQ

How much does Gemini 3.1 Flash Lite cost?

Gemini 3.1 Flash Lite costs $0.250 per million input tokens and $1.50 per million output tokens via OpenRouter, making it 110th cheapest of 332 paid models.

How smart is Gemini 3.1 Flash Lite?

Gemini 3.1 Flash Lite scores 25.0 on the Artificial Analysis Intelligence Index, ranking 105th of 182 benchmarked models, with a GPQA Diamond score of 82%.