Qwen: Qwen3 30B A3B Instruct 2507

qwen/qwen3-30b-a3b-instruct-2507

Cheaper than 96% of paidToolsJSON

Use via OpenRouter ↗

Intelligence

—

Design Elo

—

Speed

—

tokens/sec

Latency

—

first token

Input price

$0.048

13th cheapest

Context

262K

32K max out

How it compares

Cheaper than96%

of all ranked models

Overview

Qwen3-30B-A3B-Instruct-2507 is a 30.5B-parameter mixture-of-experts language model from Qwen, with 3.3B active parameters per inference. It operates in non-thinking mode and is designed for high-quality instruction following, multilingual understanding, and...

Providers & pricing (5)

Provider	In $/M	Out $/M	Context	Uptime
StreamLake	$0.048	$0.193	128K	100%
SiliconFlowfp8	$0.090	$0.300	262K	96.9%
Nebiusfp8	$0.100	$0.300	262K	98.6%
CoreWeavebf16	$0.100	$0.300	262K	100%
Alibabafp8	$0.130	$0.520	131K	100%

Specifications

Context window262K

Max output32K

Knowledge cutoffJun 2025

Input modalitiestext

Output modalitiestext

Prompt caching—

Cache read price—

ModeratedNo

Open weightsQwen/Qwen3-30B-A3B-Instruct-2507 ↗

Qwen3 30B A3B Instruct 2507 FAQ

How much does Qwen3 30B A3B Instruct 2507 cost?

Qwen3 30B A3B Instruct 2507 costs $0.048 per million input tokens and $0.193 per million output tokens via OpenRouter, making it 13th cheapest of 332 paid models.