modelgrep

Longest-Context Google Models

Quick answer · Updated June 2026

Gemini 3.1 Pro Preview Custom Tools has the largest context window of any Google model, at 1.0M tokens. Gemini 3.5 Flash (1.0M) and Gemini 3.1 Flash Lite (1.0M) round out the top three.

1.0MContext
70 t/sSpeed
$2.00Input /M

AI models with the largest context windows, ranked by token capacity. The best large language models for long documents, codebases and extended conversations.

  1. 1G
    gemini-3.1-pro-preview-customtools
    ReasoningToolsJSON+2$2.00/M · 70 t/s · 3.4s ttft
    1.0M
    Context
  2. 2G
    gemini-3.5-flash
    ReasoningToolsJSON+243.3 intel · $1.50/M · 148 t/s
    1.0M
    Context
  3. 3G
    gemini-3.1-flash-lite
    ReasoningToolsJSON+2$0.250/M · 105 t/s · 667ms ttft
    1.0M
    Context
  4. 4G
    lyria-3-pro-preview
    JSONVisionFree/M · 7 t/s · 6.9s ttft
    1.0M
    Context
  5. 5G
    lyria-3-clip-preview
    JSONVisionFree/M
    1.0M
    Context
  6. 6G
    gemini-3.1-flash-lite-preview
    ReasoningToolsJSON+233.5 intel · $0.250/M · 108 t/s
    1.0M
    Context
  7. 7G
    gemini-3.1-pro-preview
    ReasoningToolsJSON+241.3 intel · $2.00/M · 85 t/s
    1.0M
    Context
  8. 8G
    gemini-3-flash-preview
    ReasoningToolsJSON+246.4 intel · $0.500/M · 68 t/s
    1.0M
    Context
  9. 9G
    gemini-2.5-flash-lite-preview-09-2025
    ReasoningToolsJSON+219.4 intel · $0.100/M · 178 t/s
    1.0M
    Context
  10. 10G
    gemini-2.5-flash-lite
    ReasoningToolsJSON+217.6 intel · $0.100/M · 103 t/s
    1.0M
    Context
  11. 11G
    gemini-2.5-flash
    ReasoningToolsJSON+2$0.300/M · 79 t/s · 621ms ttft
    1.0M
    Context
  12. 12G
    gemini-2.5-pro
    ReasoningToolsJSON+234.6 intel · $1.25/M · 99 t/s
    1.0M
    Context
  13. 13G
    gemini-2.5-pro-preview
    ReasoningToolsJSON+2$1.25/M · 99 t/s · 1.1s ttft
    1.0M
    Context
  14. 14G
    gemini-2.5-pro-preview-05-06
    ReasoningToolsJSON+2$1.25/M · 99 t/s · 1.1s ttft
    1.0M
    Context
  15. 15G
    gemma-4-26b-a4b-it:free
    ReasoningToolsJSON+131.2 intel · Free/M · 40 t/s
    262K
    Context
  16. 16G
    gemma-4-26b-a4b-it
    ReasoningToolsJSON+131.2 intel · $0.060/M · 40 t/s
    262K
    Context
  17. 17G
    gemma-4-31b-it:free
    ReasoningToolsJSON+139.2 intel · Free/M · 61 t/s
    262K
    Context
  18. 18G
    gemma-4-31b-it
    ReasoningToolsJSON+139.2 intel · $0.120/M · 61 t/s
    262K
    Context

Frequently asked

Which Google model has the largest context window?

Gemini 3.1 Pro Preview Custom Tools has the largest context window of any Google model, at 1.0M tokens. Gemini 3.5 Flash (1.0M) and Gemini 3.1 Flash Lite (1.0M) round out the top three.

What's a good alternative to Gemini 3.1 Pro Preview Custom Tools?

Gemini 3.5 Flash (1.0M) is the closest alternative on this metric, followed by Gemini 3.1 Flash Lite (1.0M). See the full ranking above for the tradeoffs.

How many Google models are there?

modelgrep tracks 26 Google models with live benchmarks, speed, latency and per-provider pricing, led on intelligence by Gemini 3 Flash Preview. 18 of them qualify for this ranking.

More Google rankings

All rankings