Qwen: Qwen3 8B

qwen/qwen3-8b

Cheaper than 81% of paidReasoningToolsJSON

Use via OpenRouter ↗

Intelligence

—

Design Elo

—

Speed

—

tokens/sec

Latency

—

first token

Input price

$0.117

62nd cheapest

Context

131K

8K max out

How it compares

Cheaper than81%

of all ranked models

Overview

Qwen3-8B is a dense 8.2B parameter causal language model from the Qwen3 series, designed for both reasoning-heavy tasks and efficient dialogue. It supports seamless switching between "thinking" mode for math,...

Providers & pricing (1)

Provider	In $/M	Out $/M	Context	Uptime
Alibabafp8	$0.117	$0.455	131K	100%

Specifications

Context window131K

Max output8K

Knowledge cutoffMar 2025

Input modalitiestext

Output modalitiestext

Prompt caching—

Cache read price—

ModeratedNo

Open weightsQwen/Qwen3-8B ↗

Qwen3 8B FAQ

How much does Qwen3 8B cost?

Qwen3 8B costs $0.117 per million input tokens and $0.455 per million output tokens via OpenRouter, making it 62nd cheapest of 332 paid models.

What is Qwen3 8B's context window?

Qwen3 8B supports a 131K-token context window and can output up to 8K tokens. It accepts text input.

More from qwen

All Qwen3 8B alternatives →

qwen/qwen3.5-plus-20260420

Compare head-to-head

Qwen: Qwen3 8B vs Qwen: Qwen3.7 Flash Qwen: Qwen3 8B vs Qwen: Qwen3.7 Plus Qwen: Qwen3 8B vs Qwen: Qwen3.7 Max Qwen: Qwen3 8B vs Qwen: Qwen3.5 Plus 2026-04-20 Qwen: Qwen3 8B vs Qwen: Qwen3.6 Flash Qwen: Qwen3 8B vs Qwen: Qwen3.6 35B A3B