Longest-Context Google Models

Match · Updated July 2026

Gemini 3.6 Flash has the largest context window of any Google model, at 1.0M tokens. Gemini 3.6 Flash (batch) (1.0M) and Gemini 3.5 Flash Lite (1.0M) round out the top three.

1.0MContext

50.1Intelligence

$1.50Input /M

AI models with the largest context windows, ranked by token capacity. The best large language models for long documents, codebases and extended conversations.

Frequently asked

Which Google model has the largest context window?

Gemini 3.6 Flash has the largest context window of any Google model, at 1.0M tokens. Gemini 3.6 Flash (batch) (1.0M) and Gemini 3.5 Flash Lite (1.0M) round out the top three.

What's a good alternative to Gemini 3.6 Flash?

Gemini 3.6 Flash (batch) (1.0M) is the closest alternative on this metric, followed by Gemini 3.5 Flash Lite (1.0M). See the full ranking above for the tradeoffs.

How many Google models are there?

modelgrep tracks 39 Google models with live benchmarks, speed, latency and per-provider pricing, led on intelligence by Gemini 3.5 Flash. 25 of them qualify for this ranking.

More Google rankings

Google: Smartest LLMs Google: Best LLMs for Coding Google: Best LLMs for Design & Frontend Google: Fastest LLMs Google: Lowest-Latency LLMs Google: Cheapest LLMs Google: Best Free LLMs Google: Best Reasoning LLMs Google: Best Vision LLMs Google: Best LLMs for Agents Google: Best Open-Source LLMs Google: Best LLMs for Writing Google: Best LLMs for Math & Science Google: Best LLMs for RAG Google: Best LLMs for SQL & Data Analysis Google: Best LLMs for Roleplay Google: Best Uncensored LLMs

All rankings

Small & Fast LLMs Best Local LLMs Smartest LLMs Best LLMs for Coding Best LLMs for Design & Frontend Fastest LLMs Lowest-Latency LLMs Cheapest LLMs Best Free LLMs Best Reasoning LLMs Best Vision LLMs Best LLMs for Agents Best Open-Source LLMs Best LLMs for Writing Best LLMs for Math & Science Best LLMs for RAG Best LLMs for SQL & Data Analysis Best LLMs for Roleplay Best Uncensored LLMs