modelgrep
Q

Qwen: Qwen3 Coder Flash

qwen/qwen3-coder-flash

ToolsJSON
Use via OpenRouter ↗
Intelligence
Design Elo
Speed
40
201st fastest
Latency
1.4s
first token
Input price
$0.195
93rd cheapest
Context
1M
66K max out

How it compares

Faster than32%
of all ranked models
Cheaper than69%
of all ranked models

Overview

Qwen3 Coder Flash is Alibaba's fast and cost efficient version of their proprietary Qwen3 Coder Plus. It is a powerful coding agent model specializing in autonomous programming via tool calling...

Providers & pricing (1)

ProviderIn $/MOut $/MUptime
Alibaba$0.195$0.975100%

Specifications

Context window1M
Max output66K
Knowledge cutoffJun 2025
Input modalitiestext
Output modalitiestext
Prompt caching
Cache read price$0.039/M
ModeratedNo

Qwen3 Coder Flash FAQ

How much does Qwen3 Coder Flash cost?

Qwen3 Coder Flash costs $0.195 per million input tokens and $0.975 per million output tokens via OpenRouter, making it 93rd cheapest of 298 paid models.

How fast is Qwen3 Coder Flash?

Qwen3 Coder Flash generates around 40 tokens per second with 1.4s time-to-first-token (p50), the 201st fastest tracked model.

What is Qwen3 Coder Flash's context window?

Qwen3 Coder Flash supports a 1M-token context window and can output up to 66K tokens. It accepts text input.

Compare head-to-head