modelgrep
Q

Qwen: Qwen2.5 VL 72B Instruct

qwen/qwen2.5-vl-72b-instruct

JSONVision
Use via OpenRouter ↗
Intelligence
Design Elo
Speed
19
271st fastest
Latency
1.2s
first token
Input price
$0.800
195th cheapest
Context
131K
128K max out

How it compares

Faster than9%
of all ranked models
Cheaper than35%
of all ranked models

Overview

Qwen2.5-VL is proficient in recognizing common objects such as flowers, birds, fish, and insects. It is also highly capable of analyzing texts, charts, icons, graphics, and layouts within images.

Providers & pricing (1)

ProviderIn $/MOut $/MUptime
Parasailfp8$0.800$1.00100%

Specifications

Context window131K
Max output128K
Knowledge cutoffJun 2024
Input modalitiestext, image
Output modalitiestext
Prompt caching
Cache read price$0.400/M
ModeratedNo

Qwen2.5 VL 72B Instruct FAQ

How much does Qwen2.5 VL 72B Instruct cost?

Qwen2.5 VL 72B Instruct costs $0.800 per million input tokens and $1.00 per million output tokens via OpenRouter, making it 195th cheapest of 298 paid models.

How fast is Qwen2.5 VL 72B Instruct?

Qwen2.5 VL 72B Instruct generates around 19 tokens per second with 1.2s time-to-first-token (p50), the 271st fastest tracked model.

What is Qwen2.5 VL 72B Instruct's context window?

Qwen2.5 VL 72B Instruct supports a 131K-token context window and can output up to 128K tokens. It accepts text, image input.

Compare head-to-head