qwen/qwen3-8b
Qwen3-8B is a dense 8.2B parameter causal language model from the Qwen3 series, designed for both reasoning-heavy tasks and efficient dialogue. It supports seamless switching between "thinking" mode for math,...
| Provider | In $/M | Out $/M | Context | Uptime |
|---|---|---|---|---|
| AtlasCloudfp8 | $0.050 | $0.400 | 41K | 99.8% |
| Alibaba | $0.117 | $0.455 | 131K | 99.9% |
Qwen3 8B costs $0.050 per million input tokens and $0.400 per million output tokens via OpenRouter, making it 18th cheapest of 298 paid models.
Qwen3 8B scores 10.6 on the Artificial Analysis Intelligence Index, ranking 164th of 178 benchmarked models, with a GPQA Diamond score of 45%.
Qwen3 8B generates around 29 tokens per second with 672ms time-to-first-token (p50), the 240th fastest tracked model.
Qwen3 8B supports a 131K-token context window and can output up to 8K tokens. It accepts text input.