modelgrep
Q

Qwen: Qwen3.5-27B

qwen/qwen3.5-27b

54th smartest of 178ReasoningToolsJSONVision
Use via OpenRouter ↗
Intelligence
37.2
54th of 178
Design Elo
Speed
45
187th fastest
Latency
207ms
first token
Input price
$0.195
92nd cheapest
Context
262K
66K max out

How it compares

Smarter than70%
of all ranked models
Faster than37%
of all ranked models
Cheaper than69%
of all ranked models

Overview

The Qwen3.5 27B native vision-language Dense model incorporates a linear attention mechanism, delivering fast response times while balancing inference speed and performance. Its overall capabilities are comparable to those of...

Benchmarks

independent · via OpenRouter
Artificial Analysis67th percentile
Intelligence Index
37.2
Coding Index
33.4
Agentic Index
51.5
GPQA Diamond
84%
Humanity's Last Exam
13%
SciCode
37%
Tau²-Bench (agentic)
87%

Providers & pricing (6)

ProviderIn $/MOut $/MUptime
Alibaba$0.195$1.5695.6%
SiliconFlowfp8$0.250$2.0094.4%
DeepInfrafp8$0.260$2.6099.9%
AtlasCloudfp8$0.270$2.1699%
Phala$0.300$2.4045.3%
Novitabf16$0.300$2.4092.6%

Specifications

Context window262K
Max output66K
Knowledge cutoff
Input modalitiestext, image, video
Output modalitiestext
Prompt caching
Cache read price
ModeratedNo

Qwen3.5-27B FAQ

How much does Qwen3.5-27B cost?

Qwen3.5-27B costs $0.195 per million input tokens and $1.56 per million output tokens via OpenRouter, making it 92nd cheapest of 298 paid models.

How smart is Qwen3.5-27B?

Qwen3.5-27B scores 37.2 on the Artificial Analysis Intelligence Index, ranking 54th of 178 benchmarked models, with a GPQA Diamond score of 84%.

How fast is Qwen3.5-27B?

Qwen3.5-27B generates around 45 tokens per second with 207ms time-to-first-token (p50), the 187th fastest tracked model.

What is Qwen3.5-27B's context window?

Qwen3.5-27B supports a 262K-token context window and can output up to 66K tokens. It accepts text, image, video input.

Compare head-to-head