Z.ai: GLM 4.5V

z-ai/glm-4.5v

142nd smartest of 178ReasoningToolsJSONVision

Use via OpenRouter ↗

Intelligence

15.1

142nd of 178

Design Elo

—

Speed

195th fastest

Latency

1.1s

first token

Input price

$0.600

175th cheapest

Context

66K

16K max out

How it compares

Smarter than20%

of all ranked models

Faster than34%

of all ranked models

Cheaper than41%

of all ranked models

Overview

GLM-4.5V is a vision-language foundation model for multimodal agent applications. Built on a Mixture-of-Experts (MoE) architecture with 106B parameters and 12B activated parameters, it achieves state-of-the-art results in video understanding,...

Benchmarks

independent · via OpenRouter

Artificial Analysis21th percentile

Intelligence Index

15.1

Coding Index

10.9

Agentic Index

10.9

GPQA Diamond

68%

Humanity's Last Exam

SciCode

22%

Tau²-Bench (agentic)

23%

Providers & pricing (2)

Provider	In $/M	Out $/M	Context	Uptime
Novitafp8	$0.600	$1.80	66K	—
Z.AIfp8cache	$0.600	$1.80	66K	—

Specifications

Context window66K

Max output16K

Knowledge cutoffDec 2024

Input modalitiestext, image

Output modalitiestext

Prompt cachingSupported

Cache read price$0.110/M

ModeratedNo

Open weightszai-org/GLM-4.5V ↗

GLM 4.5V FAQ

How much does GLM 4.5V cost?

GLM 4.5V costs $0.600 per million input tokens and $1.80 per million output tokens via OpenRouter, making it 175th cheapest of 298 paid models.

How smart is GLM 4.5V?

GLM 4.5V scores 15.1 on the Artificial Analysis Intelligence Index, ranking 142nd of 178 benchmarked models, with a GPQA Diamond score of 68%.

How fast is GLM 4.5V?

GLM 4.5V generates around 43 tokens per second with 1.1s time-to-first-token (p50), the 195th fastest tracked model.

What is GLM 4.5V's context window?

GLM 4.5V supports a 66K-token context window and can output up to 16K tokens. It accepts text, image input.

Similar models

All GLM 4.5V alternatives →

qwen/qwen3-30b-a3b-instruct-2507

Intel 15.0$0.048/M

nvidia/nemotron-nano-12b-v2-vl:free

Intel 14.9Free/M

qwen/qwen3-30b-a3b

Intel 15.3$0.120/M

mistralai/ministral-8b-2512

Intel 14.8$0.150/M

nvidia/nemotron-nano-9b-v2:free

Intel 14.8Free/M

meta-llama/llama-3.3-70b-instruct:free

Intel 14.5Free/M

Compare head-to-head

Z.ai: GLM 4.5V vs Qwen: Qwen3 30B A3B Instruct 2507 Z.ai: GLM 4.5V vs NVIDIA: Nemotron Nano 12B 2 VL (free)Z.ai: GLM 4.5V vs Qwen: Qwen3 30B A3B Z.ai: GLM 4.5V vs Mistral: Ministral 3 8B 2512 Z.ai: GLM 4.5V vs NVIDIA: Nemotron Nano 9B V2 (free)Z.ai: GLM 4.5V vs Meta: Llama 3.3 70B Instruct (free)