z-ai/glm-5.1
GLM-5.1 delivers a major leap in coding capability, with particularly significant gains in handling long-horizon tasks. Unlike previous models built around minute-level interactions, GLM-5.1 can work independently and continuously on...
| Provider | In $/M | Out $/M | Context | Uptime |
|---|---|---|---|---|
| GMICloudfp8 | $0.980 | $3.08 | 203K | 70.7% |
| Baidufp8 | $0.980 | $3.08 | 203K | 100% |
| Waferfp4 | $1.00 | $3.20 | 203K | 98.6% |
| DeepInfrafp4 | $1.05 | $3.50 | 203K | 99.9% |
| StreamLake | $1.19 | $3.74 | 200K | 99.9% |
| Chutesfp8 | $1.20 | $4.00 | 203K | 96% |
| Phala | $1.21 | $4.20 | 203K | 94.7% |
| AtlasCloudfp8 | $1.26 | $3.96 | 203K | 99.7% |
| BaseTenfp4 | $1.30 | $4.30 | 203K | 2.1% |
| Novitafp8 | $1.38 | $4.40 | 205K | 100% |
| Together | $1.40 | $4.40 | 203K | 96.7% |
| Parasailfp8 | $1.40 | $4.40 | 203K | 100% |
| Fireworks | $1.40 | $4.40 | 203K | 100% |
| Z.AIfp8 | $1.40 | $4.40 | 203K | 100% |
| SiliconFlowfp8 | $1.40 | $4.40 | 205K | 99.8% |
| Ambientfp8 | $1.40 | $4.40 | 203K | 96.5% |
| Friendli | $1.40 | $4.40 | 203K | 99.9% |
| Inceptronfp8 | $1.40 | $4.40 | 203K | 99.1% |
| Venicefp8 | $1.75 | $5.50 | 200K | 100% |
GLM 5.1 costs $0.980 per million input tokens and $3.08 per million output tokens via OpenRouter, making it 203rd cheapest of 298 paid models.
GLM 5.1 scores 43.8 on the Artificial Analysis Intelligence Index, ranking 26th of 180 benchmarked models, with a GPQA Diamond score of 84%.
GLM 5.1 generates around 93 tokens per second with 529ms time-to-first-token (p50), the 74th fastest tracked model.
GLM 5.1 supports a 203K-token context window. It accepts text input.