Qwen: Qwen3 32B

qwen/qwen3-32b

Cheaper than 89% of paidReasoningToolsJSON

Use via OpenRouter ↗

Intelligence

—

Design Elo

—

Speed

—

tokens/sec

Latency

—

first token

Input price

$0.080

37th cheapest

Context

131K

16K max out

How it compares

Cheaper than89%

of all ranked models

Overview

Qwen3-32B is a dense 32.8B parameter causal language model from the Qwen3 series, optimized for both complex reasoning and efficient dialogue. It supports seamless switching between a "thinking" mode for...

Providers & pricing (5)

Provider	In $/M	Out $/M	Context	Uptime
DeepInfrafp8	$0.080	$0.280	41K	100%
Nebiusfp8	$0.100	$0.300	41K	99.8%
Alibabafp8	$0.104	$0.416	131K	0%
SiliconFlowfp8	$0.140	$0.570	131K	98.7%
Groq	$0.290	$0.590	131K	100%

Specifications

Context window131K

Max output16K

Knowledge cutoffMar 2025

Input modalitiestext

Output modalitiestext

Prompt caching—

Cache read price—

ModeratedNo

Open weightsQwen/Qwen3-32B ↗

Qwen3 32B FAQ

How much does Qwen3 32B cost?

Qwen3 32B costs $0.080 per million input tokens and $0.280 per million output tokens via OpenRouter, making it 37th cheapest of 332 paid models.

What is Qwen3 32B's context window?

Qwen3 32B supports a 131K-token context window and can output up to 16K tokens. It accepts text input.

More from qwen

All Qwen3 32B alternatives →

qwen/qwen3.5-plus-20260420

Compare head-to-head

Qwen: Qwen3 32B vs Qwen: Qwen3.7 Flash Qwen: Qwen3 32B vs Qwen: Qwen3.7 Plus Qwen: Qwen3 32B vs Qwen: Qwen3.7 Max Qwen: Qwen3 32B vs Qwen: Qwen3.5 Plus 2026-04-20 Qwen: Qwen3 32B vs Qwen: Qwen3.6 Flash Qwen: Qwen3 32B vs Qwen: Qwen3.6 35B A3B