modelgrep
G

Google: Gemini 2.5 Flash

google/gemini-2.5-flash

ReasoningToolsJSONVisionAudio
Use via OpenRouter ↗
Intelligence
Design Elo
1171
Data Viz
Speed
74
99th fastest
Latency
618ms
first token
Input price
$0.300
137th cheapest
Context
1.0M
66K max out

How it compares

Faster than68%
of all ranked models
Cheaper than54%
of all ranked models

Overview

Gemini 2.5 Flash is Google's state-of-the-art workhorse model, specifically designed for advanced reasoning, coding, mathematics, and scientific tasks. It includes built-in "thinking" capabilities, enabling it to provide responses with greater...

Benchmarks

independent · via OpenRouter
Artificial Analysis
GPQA Diamond
59%
Humanity's Last Exam
5%
SciCode
23%
Design Arena · Elo9,097 tournaments
Data Viz
1171
codecategories
1156
3D
1152
Game Dev
1135
Website
1119
svg
1077
UI Component
1061

Providers & pricing (4)

ProviderIn $/MOut $/MUptime
Google$0.300$2.5096.8%
Google$0.300$2.5095.6%
Google AI Studio$0.300$2.5099.9%
Google$0.300$2.5085.8%

Specifications

Context window1.0M
Max output66K
Knowledge cutoffJan 2025
Input modalitiesfile, image, text, audio, video
Output modalitiestext
Prompt caching
Cache read price$0.030/M
ModeratedNo

Gemini 2.5 Flash FAQ

How much does Gemini 2.5 Flash cost?

Gemini 2.5 Flash costs $0.300 per million input tokens and $2.50 per million output tokens via OpenRouter, making it 137th cheapest of 298 paid models.

How fast is Gemini 2.5 Flash?

Gemini 2.5 Flash generates around 74 tokens per second with 618ms time-to-first-token (p50), the 99th fastest tracked model.

What is Gemini 2.5 Flash's context window?

Gemini 2.5 Flash supports a 1.0M-token context window and can output up to 66K tokens. It accepts file, image, text, audio, video input.

Compare head-to-head