modelgrep
Q

Qwen: Qwen3.5-122B-A10B

qwen/qwen3.5-122b-a10b

60th smartest of 178ReasoningToolsJSONVision
Use via OpenRouter ↗
Intelligence
35.9
60th of 178
Design Elo
Speed
93
73rd fastest
Latency
787ms
first token
Input price
$0.260
117th cheapest
Context
262K
262K max out

How it compares

Smarter than66%
of all ranked models
Faster than75%
of all ranked models
Cheaper than61%
of all ranked models

Overview

The Qwen3.5 122B-A10B native vision-language model is built on a hybrid architecture that integrates a linear attention mechanism with a sparse mixture-of-experts model, achieving higher inference efficiency. In terms of...

Benchmarks

independent · via OpenRouter
Artificial Analysis65th percentile
Intelligence Index
35.9
Coding Index
31.6
Agentic Index
49.5
GPQA Diamond
83%
Humanity's Last Exam
15%
SciCode
36%
Tau²-Bench (agentic)
85%

Providers & pricing (4)

ProviderIn $/MOut $/MUptime
SiliconFlowfp8$0.260$2.08100%
Alibaba$0.260$2.08100%
AtlasCloudfp8$0.300$2.40100%
Novitabf16$0.400$3.20100%

Specifications

Context window262K
Max output262K
Knowledge cutoff
Input modalitiestext, image, video
Output modalitiestext
Prompt caching
Cache read price
ModeratedNo

Qwen3.5-122B-A10B FAQ

How much does Qwen3.5-122B-A10B cost?

Qwen3.5-122B-A10B costs $0.260 per million input tokens and $2.08 per million output tokens via OpenRouter, making it 117th cheapest of 298 paid models.

How smart is Qwen3.5-122B-A10B?

Qwen3.5-122B-A10B scores 35.9 on the Artificial Analysis Intelligence Index, ranking 60th of 178 benchmarked models, with a GPQA Diamond score of 83%.

How fast is Qwen3.5-122B-A10B?

Qwen3.5-122B-A10B generates around 93 tokens per second with 787ms time-to-first-token (p50), the 73rd fastest tracked model.

What is Qwen3.5-122B-A10B's context window?

Qwen3.5-122B-A10B supports a 262K-token context window and can output up to 262K tokens. It accepts text, image, video input.

Compare head-to-head