Qwen: Qwen3 30B A3B

qwen/qwen3-30b-a3b

Cheaper than 81% of paidReasoningToolsJSON

Use via OpenRouter ↗

Intelligence

—

Design Elo

997

Data Viz

Speed

—

tokens/sec

Latency

—

first token

Input price

$0.120

64th cheapest

Context

131K

16K max out

How it compares

Cheaper than81%

of all ranked models

Overview

Qwen3, the latest generation in the Qwen large language model series, features both dense and mixture-of-experts (MoE) architectures to excel in reasoning, multilingual support, and advanced agent tasks. Its unique...

Benchmarks

independent · Artificial Analysis & Design Arena

Design Arena · Elo100 tournaments

Data Viz

997

Website

979

Providers & pricing (2)

Provider	In $/M	Out $/M	Context	Uptime
DeepInfrafp8	$0.120	$0.500	41K	100%
Alibabafp8	$0.130	$0.520	131K	100%

Specifications

Context window131K

Max output16K

Knowledge cutoffMar 2025

Input modalitiestext

Output modalitiestext

Prompt caching—

Cache read price—

ModeratedNo

Open weightsQwen/Qwen3-30B-A3B ↗

Qwen3 30B A3B FAQ

How much does Qwen3 30B A3B cost?

Qwen3 30B A3B costs $0.120 per million input tokens and $0.500 per million output tokens via OpenRouter, making it 64th cheapest of 332 paid models.

What is Qwen3 30B A3B's context window?

Qwen3 30B A3B supports a 131K-token context window and can output up to 16K tokens. It accepts text input.

More from qwen

All Qwen3 30B A3B alternatives →

qwen/qwen3.5-plus-20260420

Compare head-to-head

Qwen: Qwen3 30B A3B vs Qwen: Qwen3.7 Flash Qwen: Qwen3 30B A3B vs Qwen: Qwen3.7 Plus Qwen: Qwen3 30B A3B vs Qwen: Qwen3.7 Max Qwen: Qwen3 30B A3B vs Qwen: Qwen3.5 Plus 2026-04-20 Qwen: Qwen3 30B A3B vs Qwen: Qwen3.6 Flash Qwen: Qwen3 30B A3B vs Qwen: Qwen3.6 35B A3B