modelgrep

Longest-Context Z.ai Models

Quick answer · Updated June 2026

GLM 5 Turbo has the largest context window of any Z.ai model, at 262K tokens. GLM 5.1 (203K) and GLM 5 (203K) round out the top three.

262KContext
46.8Intelligence
29 t/sSpeed
$1.20Input /M

AI models with the largest context windows, ranked by token capacity. The best large language models for long documents, codebases and extended conversations.

  1. 1Z
    glm-5-turbo
    ReasoningToolsJSON46.8 intel · $1.20/M · 29 t/s
    262K
    Context
  2. 2Z
    glm-5.1
    ReasoningToolsJSON43.8 intel · $0.980/M · 84 t/s
    203K
    Context
  3. 3Z
    glm-5
    ReasoningToolsJSON40.6 intel · $0.600/M · 108 t/s
    203K
    Context
  4. 4Z
    glm-4.7-flash
    ReasoningToolsJSON30.1 intel · $0.060/M · 35 t/s
    203K
    Context
  5. 5Z
    glm-4.7
    ReasoningToolsJSON42.1 intel · $0.400/M · 452 t/s
    203K
    Context
  6. 6Z
    glm-4.6
    ReasoningToolsJSON30.2 intel · $0.430/M · 37 t/s
    203K
    Context

Frequently asked

Which Z.ai model has the largest context window?

GLM 5 Turbo has the largest context window of any Z.ai model, at 262K tokens. GLM 5.1 (203K) and GLM 5 (203K) round out the top three.

What's a good alternative to GLM 5 Turbo?

GLM 5.1 (203K) is the closest alternative on this metric, followed by GLM 5 (203K). See the full ranking above for the tradeoffs.

How many Z.ai models are there?

modelgrep tracks 10 Z.ai models with live benchmarks, speed, latency and per-provider pricing, led on intelligence by GLM 5 Turbo. 6 of them qualify for this ranking.

More Z.ai rankings

All rankings