modelgrep
Q

Qwen: Qwen-Plus

qwen/qwen-plus

ToolsJSON
Use via OpenRouter ↗
Intelligence
Design Elo
Speed
51
156th fastest
Latency
433ms
first token
Input price
$0.260
122nd cheapest
Context
1M
33K max out

How it compares

Faster than47%
of all ranked models
Cheaper than59%
of all ranked models

Overview

Qwen-Plus, based on the Qwen2.5 foundation model, is a 131K context model with a balanced performance, speed, and cost combination.

Providers & pricing (1)

ProviderIn $/MOut $/MUptime
Alibaba$0.260$0.780100%

Specifications

Context window1M
Max output33K
Knowledge cutoffMar 2025
Input modalitiestext
Output modalitiestext
Prompt caching
Cache read price$0.052/M
ModeratedNo

Qwen-Plus FAQ

How much does Qwen-Plus cost?

Qwen-Plus costs $0.260 per million input tokens and $0.780 per million output tokens via OpenRouter, making it 122nd cheapest of 298 paid models.

How fast is Qwen-Plus?

Qwen-Plus generates around 51 tokens per second with 433ms time-to-first-token (p50), the 156th fastest tracked model.

What is Qwen-Plus's context window?

Qwen-Plus supports a 1M-token context window and can output up to 33K tokens. It accepts text input.

Compare head-to-head