Qwen: Qwen3.5-Flash

qwen/qwen3.5-flash-02-23

Cheaper than 91% of paidReasoningToolsJSONVision

Use via OpenRouter ↗

Intelligence

—

Design Elo

—

Speed

—

tokens/sec

Latency

—

first token

Input price

$0.065

29th cheapest

Context

66K max out

How it compares

Cheaper than91%

of all ranked models

Overview

The Qwen3.5 native vision-language Flash models are built on a hybrid architecture that integrates a linear attention mechanism with a sparse mixture-of-experts model, achieving higher inference efficiency. Compared to the...

Providers & pricing (1)

Provider	In $/M	Out $/M	Context	Uptime
Alibabafp8	$0.065	$0.260	1M	100%

Specifications

Context window1M

Max output66K

Knowledge cutoff—

Input modalitiestext, image, video

Output modalitiestext

Prompt caching—

Cache read price—

ModeratedNo

Qwen3.5-Flash FAQ

How much does Qwen3.5-Flash cost?

Qwen3.5-Flash costs $0.065 per million input tokens and $0.260 per million output tokens via OpenRouter, making it 29th cheapest of 332 paid models.

What is Qwen3.5-Flash's context window?

Qwen3.5-Flash supports a 1M-token context window and can output up to 66K tokens. It accepts text, image, video input.

More from qwen

All Qwen3.5-Flash alternatives →

qwen/qwen3.5-plus-20260420

Compare head-to-head

Qwen: Qwen3.5-Flash vs Qwen: Qwen3.7 Flash Qwen: Qwen3.5-Flash vs Qwen: Qwen3.7 Plus Qwen: Qwen3.5-Flash vs Qwen: Qwen3.7 Max Qwen: Qwen3.5-Flash vs Qwen: Qwen3.5 Plus 2026-04-20 Qwen: Qwen3.5-Flash vs Qwen: Qwen3.6 Flash Qwen: Qwen3.5-Flash vs Qwen: Qwen3.6 35B A3B