Z.ai: GLM 4.5

z-ai/glm-4.5

90th smartest of 179ReasoningToolsJSON

Use via OpenRouter ↗

Intelligence

26.4

90th of 179

Design Elo

1251

Speed

207th fastest

Latency

1.4s

first token

Input price

$0.600

176th cheapest

Context

131K

98K max out

How it compares

Smarter than50%

of all ranked models

Faster than33%

of all ranked models

Cheaper than41%

of all ranked models

Overview

GLM-4.5 is our latest flagship foundation model, purpose-built for agent-based applications. It leverages a Mixture-of-Experts (MoE) architecture and supports a context length of up to 128k tokens. GLM-4.5 delivers significantly...

Benchmarks

independent · via OpenRouter

Artificial Analysis47th percentile

Intelligence Index

26.4

Coding Index

26.3

Agentic Index

16.2

GPQA Diamond

78%

Humanity's Last Exam

12%

SciCode

35%

Tau²-Bench (agentic)

43%

Design Arena · Elo9,161 tournaments

1251

codecategories

1217

Game Dev

1214

Website

1214

Data Viz

1204

UI Component

1201

svg

1153

Providers & pricing (2)

Provider	In $/M	Out $/M	Context	Uptime
Novitafp8	$0.600	$2.20	131K	—
Z.AIfp8	$0.600	$2.20	131K	98%

Specifications

Context window131K

Max output98K

Knowledge cutoffDec 2024

Input modalitiestext

Output modalitiestext

Prompt caching—

Cache read price$0.110/M

ModeratedNo

Open weightszai-org/GLM-4.5 ↗

GLM 4.5 FAQ

How much does GLM 4.5 cost?

GLM 4.5 costs $0.600 per million input tokens and $2.20 per million output tokens via OpenRouter, making it 176th cheapest of 298 paid models.

How smart is GLM 4.5?

GLM 4.5 scores 26.4 on the Artificial Analysis Intelligence Index, ranking 90th of 179 benchmarked models, with a GPQA Diamond score of 78%.

How fast is GLM 4.5?

GLM 4.5 generates around 42 tokens per second with 1.4s time-to-first-token (p50), the 207th fastest tracked model.

What is GLM 4.5's context window?

GLM 4.5 supports a 131K-token context window and can output up to 98K tokens. It accepts text input.

Similar models

All GLM 4.5 alternatives →

openai/gpt-4.1

Intel 26.3$2.00/M

inclusionai/ling-2.6-flash

Intel 26.2$0.010/M

qwen/qwen3-max

Intel 26.1$0.780/M

qwen/qwen3-next-80b-a3b-thinking

Compare head-to-head

Z.ai: GLM 4.5 vs OpenAI: GPT-4.1 Z.ai: GLM 4.5 vs inclusionAI: Ling-2.6-flash Z.ai: GLM 4.5 vs Qwen: Qwen3 Max Z.ai: GLM 4.5 vs Qwen: Qwen3 Next 80B A3B Thinking Z.ai: GLM 4.5 vs Upstage: Solar Pro 3 Z.ai: GLM 4.5 vs OpenAI: GPT-5 Nano