qwen/qwen3-14b
Qwen3-14B is a dense 14.8B parameter causal language model from the Qwen3 series, designed for both complex reasoning and efficient dialogue. It supports seamless switching between a "thinking" mode for...
| Provider | In $/M | Out $/M | Context | Uptime |
|---|---|---|---|---|
| NextBitint4 | $0.100 | $0.240 | 41K | 96.6% |
| DeepInfrafp8 | $0.120 | $0.240 | 41K | 100% |
| Alibaba | $0.228 | $0.910 | 131K | — |
Qwen3 14B costs $0.100 per million input tokens and $0.240 per million output tokens via OpenRouter, making it 58th cheapest of 298 paid models.
Qwen3 14B scores 16.2 on the Artificial Analysis Intelligence Index, ranking 136th of 179 benchmarked models, with a GPQA Diamond score of 60%.
Qwen3 14B generates around 66 tokens per second with 349ms time-to-first-token (p50), the 114th fastest tracked model.
Qwen3 14B supports a 132K-token context window and can output up to 41K tokens. It accepts text input.