z-ai/glm-5.2
GLM-5.2 is Z.ai’s flagship model for the era of long-horizon tasks. With a truly usable 1M-token context window, it can handle project-level engineering context, execute long-running tasks more reliably, follow...
| Provider | In $/M | Out $/M | Context | Uptime |
|---|---|---|---|---|
| Cloudflare | $1.40 | $4.40 | 262K | 98.6% |
| DeepInfrafp8 | $1.40 | $4.40 | 1.0M | 77.3% |
| Z.AIfp8 | $1.40 | $4.40 | 1.0M | 99.8% |
| Io Netfp8 | $1.40 | $4.40 | 1.0M | 92.4% |
| Novitafp8 | $1.40 | $4.40 | 1.0M | 98.8% |
| Friendli | $1.40 | $4.40 | 1.0M | 99.3% |
GLM 5.2 costs $1.40 per million input tokens and $4.40 per million output tokens via OpenRouter, making it 231st cheapest of 298 paid models.
GLM 5.2 generates around 75 tokens per second with 1.6s time-to-first-token (p50), the 98th fastest tracked model.
GLM 5.2 supports a 1.0M-token context window and can output up to 262K tokens. It accepts text input.