qwen/qwen3.5-flash-02-23
The Qwen3.5 native vision-language Flash models are built on a hybrid architecture that integrates a linear attention mechanism with a sparse mixture-of-experts model, achieving higher inference efficiency. Compared to the...
| Provider | In $/M | Out $/M | Context | Uptime |
|---|---|---|---|---|
| Alibaba | $0.065 | $0.260 | 1M | 100% |
Qwen3.5-Flash costs $0.065 per million input tokens and $0.260 per million output tokens via OpenRouter, making it 29th cheapest of 298 paid models.
Qwen3.5-Flash generates around 85 tokens per second with 626ms time-to-first-token (p50), the 83rd fastest tracked model.
Qwen3.5-Flash supports a 1M-token context window and can output up to 66K tokens. It accepts text, image, video input.