Qwen: Qwen3 VL 32B Instruct

qwen/qwen3-vl-32b-instruct

150th smartest of 182Cheaper than 82% of paidToolsJSONVision

Use via OpenRouter ↗

Intelligence

11.1

150th of 182

Design Elo

—

Speed

—

tokens/sec

Latency

—

first token

Input price

$0.104

60th cheapest

Context

131K

33K max out

How it compares

Smarter than18%

of all ranked models

Cheaper than82%

of all ranked models

Overview

Qwen3-VL-32B-Instruct is a large-scale multimodal vision-language model designed for high-precision understanding and reasoning across text, images, and video. With 32 billion parameters, it combines deep visual perception with advanced text...

Benchmarks

independent · Artificial Analysis & Design Arena

Artificial Analysis39th percentile

Intelligence Index

11.1

GPQA Diamond

67%

Humanity's Last Exam

SciCode

30%

Tau²-Bench (agentic)

29%

Providers & pricing (1)

Provider	In $/M	Out $/M	Context	Uptime
Alibabafp8	$0.104	$0.416	131K	100%

Specifications

Context window131K

Max output33K

Knowledge cutoff—

Input modalitiestext, image

Output modalitiestext

Prompt caching—

Cache read price—

ModeratedNo

Open weightsQwen/Qwen3-VL-32B-Instruct ↗

Qwen3 VL 32B Instruct FAQ

How much does Qwen3 VL 32B Instruct cost?

Qwen3 VL 32B Instruct costs $0.104 per million input tokens and $0.416 per million output tokens via OpenRouter, making it 60th cheapest of 332 paid models.

How smart is Qwen3 VL 32B Instruct?

Qwen3 VL 32B Instruct scores 11.1 on the Artificial Analysis Intelligence Index, ranking 150th of 182 benchmarked models, with a GPQA Diamond score of 67%.