modelgrep

Cheapest Z.ai Models

Quick answer · Updated June 2026

The cheapest Z.ai model is GLM 4.7 Flash at $0.060 per million input tokens. GLM 4.5 Air ($0.125) and GLM 4.6V ($0.300) round out the top three.

$0.060Input /M
30.1Intelligence
35 t/sSpeed
203KContext

AI models ranked by input token price. The most affordable large language model APIs, from budget open-weight models to discounted frontier models.

  1. 1Z
    glm-4.7-flash
    ReasoningToolsJSON30.1 intel · 35 t/s · 308ms ttft
    $0.060
    Input /M
  2. 2Z
    glm-4.5-air
    ReasoningToolsJSON23.2 intel · 59 t/s · 378ms ttft
    $0.125
    Input /M
  3. 3Z
    glm-4.6v
    ReasoningToolsJSON+123.4 intel · 60 t/s · 1.5s ttft
    $0.300
    Input /M
  4. 4Z
    glm-4.7
    ReasoningToolsJSON42.1 intel · 452 t/s · 657ms ttft
    $0.400
    Input /M
  5. 5Z
    glm-4.6
    ReasoningToolsJSON30.2 intel · 37 t/s · 548ms ttft
    $0.430
    Input /M
  6. 6Z
    glm-5
    ReasoningToolsJSON40.6 intel · 108 t/s · 310ms ttft
    $0.600
    Input /M
  7. 7Z
    glm-4.5v
    ReasoningToolsJSON+115.1 intel · 19 t/s · 1.5s ttft
    $0.600
    Input /M
  8. 8Z
    glm-4.5
    ReasoningToolsJSON26.4 intel · 42 t/s · 1.4s ttft
    $0.600
    Input /M
  9. 9Z
    glm-5.1
    ReasoningToolsJSON43.8 intel · 84 t/s · 475ms ttft
    $0.980
    Input /M
  10. 10Z
    glm-5-turbo
    ReasoningToolsJSON46.8 intel · 29 t/s · 1.9s ttft
    $1.20
    Input /M

Frequently asked

What is the cheapest Z.ai model?

The cheapest Z.ai model is GLM 4.7 Flash at $0.060 per million input tokens. GLM 4.5 Air ($0.125) and GLM 4.6V ($0.300) round out the top three.

What's a good alternative to GLM 4.7 Flash?

GLM 4.5 Air ($0.125) is the closest alternative on this metric, followed by GLM 4.6V ($0.300). See the full ranking above for the tradeoffs.

How many Z.ai models are there?

modelgrep tracks 10 Z.ai models with live benchmarks, speed, latency and per-provider pricing, led on intelligence by GLM 5 Turbo. 10 of them qualify for this ranking.

More Z.ai rankings

All rankings