modelgrep
Z

Z.ai: GLM 5.2

z-ai/glm-5.2

ReasoningToolsJSON
Use via OpenRouter ↗
Intelligence
Design Elo
1369
Game Dev
Speed
75
98th fastest
Latency
1.6s
first token
Input price
$1.40
231st cheapest
Context
1.0M
262K max out

How it compares

Faster than68%
of all ranked models
Cheaper than22%
of all ranked models

Overview

GLM-5.2 is Z.ai’s flagship model for the era of long-horizon tasks. With a truly usable 1M-token context window, it can handle project-level engineering context, execute long-running tasks more reliably, follow...

Benchmarks

independent · via OpenRouter
Design Arena · Elo1,718 tournaments
Game Dev
1369
3D
1363
codecategories
1360
Website
1357
UI Component
1355
Data Viz
1327

Providers & pricing (6)

ProviderIn $/MOut $/MUptime
Cloudflare$1.40$4.4098.6%
DeepInfrafp8$1.40$4.4077.3%
Z.AIfp8$1.40$4.4099.8%
Io Netfp8$1.40$4.4092.4%
Novitafp8$1.40$4.4098.8%
Friendli$1.40$4.4099.3%

Specifications

Context window1.0M
Max output262K
Knowledge cutoff
Input modalitiestext
Output modalitiestext
Prompt caching
Cache read price$0.260/M
ModeratedNo

GLM 5.2 FAQ

How much does GLM 5.2 cost?

GLM 5.2 costs $1.40 per million input tokens and $4.40 per million output tokens via OpenRouter, making it 231st cheapest of 298 paid models.

How fast is GLM 5.2?

GLM 5.2 generates around 75 tokens per second with 1.6s time-to-first-token (p50), the 98th fastest tracked model.

What is GLM 5.2's context window?

GLM 5.2 supports a 1.0M-token context window and can output up to 262K tokens. It accepts text input.

Compare head-to-head