qwen/qwen-2.5-7b-instruct
Qwen2.5 7B is the latest series of Qwen large language models. Qwen2.5 brings the following improvements upon Qwen2: - Significantly more knowledge and has greatly improved capabilities in coding and...
| Provider | In $/M | Out $/M | Context | Uptime |
|---|---|---|---|---|
| Phala | $0.040 | $0.100 | 33K | 92.1% |
| Togetherfp8 | $0.300 | $0.300 | 33K | 99.9% |
Qwen2.5 7B Instruct costs $0.040 per million input tokens and $0.100 per million output tokens via OpenRouter, making it 11th cheapest of 298 paid models.
Qwen2.5 7B Instruct generates around 65 tokens per second with 353ms time-to-first-token (p50), the 127th fastest tracked model.
Qwen2.5 7B Instruct supports a 131K-token context window and can output up to 33K tokens. It accepts text input.