modelgrep
Q

Qwen: Qwen3.6 Flash

qwen/qwen3.6-flash

ReasoningToolsJSONVision
Use via OpenRouter ↗
Intelligence
Design Elo
Speed
99
64th fastest
Latency
747ms
first token
Input price
$0.188
91st cheapest
Context
1M
66K max out

How it compares

Faster than79%
of all ranked models
Cheaper than69%
of all ranked models

Overview

Qwen3.6 Flash is a fast, efficient language model from Alibaba's Qwen 3.6 series. It supports text, image, and video input with a 1M token context window. Tiered pricing kicks in...

Providers & pricing (1)

ProviderIn $/MOut $/MUptime
Alibaba$0.188$1.13100%

Specifications

Context window1M
Max output66K
Knowledge cutoff
Input modalitiestext, image, video
Output modalitiestext
Prompt caching
Cache read price
ModeratedNo

Qwen3.6 Flash FAQ

How much does Qwen3.6 Flash cost?

Qwen3.6 Flash costs $0.188 per million input tokens and $1.13 per million output tokens via OpenRouter, making it 91st cheapest of 298 paid models.

How fast is Qwen3.6 Flash?

Qwen3.6 Flash generates around 99 tokens per second with 747ms time-to-first-token (p50), the 64th fastest tracked model.

What is Qwen3.6 Flash's context window?

Qwen3.6 Flash supports a 1M-token context window and can output up to 66K tokens. It accepts text, image, video input.

Compare head-to-head