modelgrep
Q

Qwen2.5 Coder 32B Instruct

qwen/qwen-2.5-coder-32b-instruct

Use via OpenRouter ↗
Intelligence
Design Elo
Speed
14
278th fastest
Latency
414ms
first token
Input price
$0.660
181st cheapest
Context
128K
33K max out

How it compares

Faster than6%
of all ranked models
Cheaper than39%
of all ranked models

Overview

Qwen2.5-Coder is the latest series of Code-Specific Qwen large language models (formerly known as CodeQwen). Qwen2.5-Coder brings the following improvements upon CodeQwen1.5: - Significantly improvements in **code generation**, **code reasoning**...

Benchmarks

independent · via OpenRouter
Artificial Analysis
GPQA Diamond
42%
Humanity's Last Exam
4%
SciCode
27%

Providers & pricing (1)

ProviderIn $/MOut $/MUptime
Cloudflare$0.660$1.00

Specifications

Context window128K
Max output33K
Knowledge cutoffJun 2024
Input modalitiestext
Output modalitiestext
Prompt caching
Cache read price
ModeratedNo

Qwen2.5 Coder 32B Instruct FAQ

How much does Qwen2.5 Coder 32B Instruct cost?

Qwen2.5 Coder 32B Instruct costs $0.660 per million input tokens and $1.00 per million output tokens via OpenRouter, making it 181st cheapest of 298 paid models.

How fast is Qwen2.5 Coder 32B Instruct?

Qwen2.5 Coder 32B Instruct generates around 14 tokens per second with 414ms time-to-first-token (p50), the 278th fastest tracked model.

What is Qwen2.5 Coder 32B Instruct's context window?

Qwen2.5 Coder 32B Instruct supports a 128K-token context window and can output up to 33K tokens. It accepts text input.

Compare head-to-head