qwen/qwen3-max-thinking
Qwen3-Max-Thinking is the flagship reasoning model in the Qwen3 series, designed for high-stakes cognitive tasks that require deep, multi-step reasoning. By significantly scaling model capacity and reinforcement learning compute, it...
| Provider | In $/M | Out $/M | Context | Uptime |
|---|---|---|---|---|
| Alibaba | $0.780 | $3.90 | 262K | — |
Qwen3 Max Thinking costs $0.780 per million input tokens and $3.90 per million output tokens via OpenRouter, making it 190th cheapest of 298 paid models.
Qwen3 Max Thinking scores 39.8 on the Artificial Analysis Intelligence Index, ranking 41st of 180 benchmarked models, with a GPQA Diamond score of 86%.
Qwen3 Max Thinking generates around 40 tokens per second with 1.1s time-to-first-token (p50), the 202nd fastest tracked model.
Qwen3 Max Thinking supports a 262K-token context window and can output up to 33K tokens. It accepts text input.