Gemini 3.1 Pro Preview Custom Tools has the largest context window of any Google model, at 1.0M tokens. Gemini 3.5 Flash (1.0M) and Gemini 3.1 Flash Lite (1.0M) round out the top three.
AI models with the largest context windows, ranked by token capacity. The best large language models for long documents, codebases and extended conversations.
Gemini 3.1 Pro Preview Custom Tools has the largest context window of any Google model, at 1.0M tokens. Gemini 3.5 Flash (1.0M) and Gemini 3.1 Flash Lite (1.0M) round out the top three.
Gemini 3.5 Flash (1.0M) is the closest alternative on this metric, followed by Gemini 3.1 Flash Lite (1.0M). See the full ranking above for the tradeoffs.
modelgrep tracks 26 Google models with live benchmarks, speed, latency and per-provider pricing, led on intelligence by Gemini 3 Flash Preview. 18 of them qualify for this ranking.