modelgrep
Z

Z.ai: GLM 5V Turbo

z-ai/glm-5v-turbo

ReasoningToolsJSONVision
Use via OpenRouter ↗
Intelligence
Design Elo
1291
3D
Speed
76
114th fastest
Latency
1.7s
first token
Input price
$1.20
217th cheapest
Context
203K
131K max out

How it compares

Faster than63%
of all ranked models
Cheaper than28%
of all ranked models

Overview

GLM-5V-Turbo is Z.ai’s first native multimodal agent foundation model, built for vision-based coding and agent-driven tasks. It natively handles image, video, and text inputs, excels at long-horizon planning, complex coding,...

Benchmarks

independent · via OpenRouter
Artificial Analysis
GPQA Diamond
81%
Humanity's Last Exam
16%
SciCode
44%
Tau²-Bench (agentic)
99%
Design Arena · Elo0 tournaments
3D
1291
Game Dev
1289
codecategories
1279
Website
1270
androidnative
1267
UI Component
1267
Data Viz
1247
mobileapps
1223
godotgamedev
1221
fullstack
1214
svg
1204
webapps
1188
agenticslides(python-pptx)
1183
agenticslides
1171
python-pptxslides
1167
pptxslides
1164
asciiart
1138
agenticslides(html)
1137
agentichtmlslides
1134
htmlslides
1134
agenticgamedev
1128

Providers & pricing (1)

ProviderIn $/MOut $/MUptime
Z.AIfp8$1.20$4.00

Specifications

Context window203K
Max output131K
Knowledge cutoff
Input modalitiesimage, text, video
Output modalitiestext
Prompt caching
Cache read price$0.240/M
ModeratedNo

GLM 5V Turbo FAQ

How much does GLM 5V Turbo cost?

GLM 5V Turbo costs $1.20 per million input tokens and $4.00 per million output tokens via OpenRouter, making it 217th cheapest of 301 paid models.

How fast is GLM 5V Turbo?

GLM 5V Turbo generates around 76 tokens per second with 1.7s time-to-first-token (p50), the 114th fastest tracked model.

What is GLM 5V Turbo's context window?

GLM 5V Turbo supports a 203K-token context window and can output up to 131K tokens. It accepts image, text, video input.

Compare head-to-head