Qwen: Qwen2.5 7B Instruct

qwen/qwen-2.5-7b-instruct

Cheaper than 82% of paidToolsJSON

Use via OpenRouter ↗

Intelligence

—

Design Elo

—

Speed

—

tokens/sec

Latency

—

first token

Input price

$0.100

59th cheapest

Context

33K

33K max out

How it compares

Cheaper than82%

of all ranked models

Overview

Qwen2.5 7B is the latest series of Qwen large language models. Qwen2.5 brings the following improvements upon Qwen2: - Significantly more knowledge and has greatly improved capabilities in coding and...

Providers & pricing (2)

Provider	In $/M	Out $/M	Context	Uptime
Phala	$0.100	$0.200	33K	100%
Togetherfp8	$0.300	$0.300	33K	100%

Specifications

Context window33K

Max output33K

Knowledge cutoffJun 2024

Input modalitiestext

Output modalitiestext

Prompt caching—

Cache read price—

ModeratedNo

Open weightsQwen/Qwen2.5-7B-Instruct ↗

Qwen2.5 7B Instruct FAQ

How much does Qwen2.5 7B Instruct cost?

Qwen2.5 7B Instruct costs $0.100 per million input tokens and $0.200 per million output tokens via OpenRouter, making it 59th cheapest of 332 paid models.