Google: Gemini 2.5 Flash Lite Preview 09-2025

google/gemini-2.5-flash-lite-preview-09-2025

121st smartest of 178Cheaper than 82% of paidReasoningToolsJSONVisionAudio

Use via OpenRouter ↗

Intelligence

19.4

121st of 178

Design Elo

1144

Website

Speed

217

15th fastest

Latency

385ms

first token

Input price

$0.100

54th cheapest

Context

1.0M

66K max out

How it compares

Smarter than32%

of all ranked models

Faster than95%

of all ranked models

Cheaper than82%

of all ranked models

Overview

Gemini 2.5 Flash-Lite is a lightweight reasoning model in the Gemini 2.5 family, optimized for ultra-low latency and cost efficiency. It offers improved throughput, faster token generation, and better performance...

Benchmarks

independent · via OpenRouter

Artificial Analysis32th percentile

Intelligence Index

19.4

Coding Index

14.5

Agentic Index

10.1

GPQA Diamond

65%

Humanity's Last Exam

SciCode

28%

Tau²-Bench (agentic)

30%

Design Arena · Elo3,139 tournaments

Website

1144

Data Viz

1134

codecategories

1133

Game Dev

1116

UI Component

1078

1048

Providers & pricing (1)

Provider	In $/M	Out $/M	Context	Uptime
Google	$0.100	$0.400	1.0M	99.6%

Specifications

Context window1.0M

Max output66K

Knowledge cutoffJan 2025

Input modalitiestext, image, file, audio, video

Output modalitiestext

Prompt caching—

Cache read price$0.010/M

ModeratedNo

Gemini 2.5 Flash Lite Preview 09-2025 FAQ

How much does Gemini 2.5 Flash Lite Preview 09-2025 cost?

Gemini 2.5 Flash Lite Preview 09-2025 costs $0.100 per million input tokens and $0.400 per million output tokens via OpenRouter, making it 54th cheapest of 298 paid models.

How smart is Gemini 2.5 Flash Lite Preview 09-2025?

Gemini 2.5 Flash Lite Preview 09-2025 scores 19.4 on the Artificial Analysis Intelligence Index, ranking 121st of 178 benchmarked models, with a GPQA Diamond score of 65%.

How fast is Gemini 2.5 Flash Lite Preview 09-2025?

Gemini 2.5 Flash Lite Preview 09-2025 generates around 217 tokens per second with 385ms time-to-first-token (p50), the 15th fastest tracked model.

What is Gemini 2.5 Flash Lite Preview 09-2025's context window?

Gemini 2.5 Flash Lite Preview 09-2025 supports a 1.0M-token context window and can output up to 66K tokens. It accepts text, image, file, audio, video input.

qwen/qwen3-vl-30b-a3b-thinking

Intel 19.7$0.130/M

amazon/nova-premier-v1

Intel 19.0$2.50/M

qwen/qwen3-235b-a22b

Intel 19.8$0.455/M

mistralai/mistral-medium-3

Intel 18.8$0.400/M

deepseek/deepseek-r1

Intel 18.8$0.700/M

qwen/qwen3-coder-30b-a3b-instruct

Intel 20.0$0.070/M