qwen/qwen3.5-27b
The Qwen3.5 27B native vision-language Dense model incorporates a linear attention mechanism, delivering fast response times while balancing inference speed and performance. Its overall capabilities are comparable to those of...
| Provider | In $/M | Out $/M | Context | Uptime |
|---|---|---|---|---|
| Alibaba | $0.195 | $1.56 | 262K | 95.6% |
| SiliconFlowfp8 | $0.250 | $2.00 | 262K | 94.4% |
| DeepInfrafp8 | $0.260 | $2.60 | 262K | 99.9% |
| AtlasCloudfp8 | $0.270 | $2.16 | 262K | 99% |
| Phala | $0.300 | $2.40 | 262K | 45.3% |
| Novitabf16 | $0.300 | $2.40 | 262K | 92.6% |
Qwen3.5-27B costs $0.195 per million input tokens and $1.56 per million output tokens via OpenRouter, making it 92nd cheapest of 298 paid models.
Qwen3.5-27B scores 37.2 on the Artificial Analysis Intelligence Index, ranking 54th of 178 benchmarked models, with a GPQA Diamond score of 84%.
Qwen3.5-27B generates around 45 tokens per second with 207ms time-to-first-token (p50), the 187th fastest tracked model.
Qwen3.5-27B supports a 262K-token context window and can output up to 66K tokens. It accepts text, image, video input.