Google: Gemini 2.5 Flash Lite

google/gemini-2.5-flash-lite

169th smartest of 182Cheaper than 84% of paidReasoningToolsJSONVisionAudio

Use via OpenRouter ↗

Intelligence

6.9

169th of 182

Design Elo

—

Speed

—

tokens/sec

Latency

—

first token

Input price

$0.100

54th cheapest

Context

1.0M

66K max out

How it compares

Smarter than7%

of all ranked models

Cheaper than84%

of all ranked models

Overview

Gemini 2.5 Flash-Lite is a lightweight reasoning model in the Gemini 2.5 family, optimized for ultra-low latency and cost efficiency. It offers improved throughput, faster token generation, and better performance...

Benchmarks

independent · Artificial Analysis & Design Arena

Artificial Analysis22th percentile

Intelligence Index

6.9

GPQA Diamond

47%

Humanity's Last Exam

SciCode

18%

Tau²-Bench (agentic)

19%

Providers & pricing (5)

Provider	In $/M	Out $/M	Context	Uptime
Google	$0.100	$0.400	1.0M	99.9%
Google	$0.100	$0.400	1.0M	99.9%
Google AI Studio	$0.100	$0.400	1.0M	99.6%
Google AI Studio	$0.050	$0.200	1.0M	99.6%
Google AI Studio	$0.180	$0.720	1.0M	99.6%

Specifications

Context window1.0M

Max output66K

Knowledge cutoffJan 2025

Input modalitiestext, image, file, audio, video

Output modalitiestext

Prompt caching—

Cache read price$0.010/M

ModeratedNo

Gemini 2.5 Flash Lite FAQ

How much does Gemini 2.5 Flash Lite cost?

Gemini 2.5 Flash Lite costs $0.100 per million input tokens and $0.400 per million output tokens via OpenRouter, making it 54th cheapest of 332 paid models.

How smart is Gemini 2.5 Flash Lite?

Gemini 2.5 Flash Lite scores 6.9 on the Artificial Analysis Intelligence Index, ranking 169th of 182 benchmarked models, with a GPQA Diamond score of 47%.