qwen/qwen3-vl-8b-thinking
Qwen3-VL-8B-Thinking is the reasoning-optimized variant of the Qwen3-VL-8B multimodal model, designed for advanced visual and textual reasoning across complex scenes, documents, and temporal sequences. It integrates enhanced multimodal alignment and...
| Provider | In $/M | Out $/M | Context | Uptime |
|---|---|---|---|---|
| Alibaba | $0.117 | $1.36 | 131K | — |
Qwen3 VL 8B Thinking costs $0.117 per million input tokens and $1.36 per million output tokens via OpenRouter, making it 65th cheapest of 298 paid models.
Qwen3 VL 8B Thinking scores 16.7 on the Artificial Analysis Intelligence Index, ranking 136th of 178 benchmarked models, with a GPQA Diamond score of 58%.
Qwen3 VL 8B Thinking generates around 126 tokens per second with 483ms time-to-first-token (p50), the 41st fastest tracked model.
Qwen3 VL 8B Thinking supports a 256K-token context window and can output up to 33K tokens. It accepts image, text input.