Magnum v4 72B

anthracite-org/magnum-v4-72b

JSON

Use via OpenRouter ↗

Intelligence

—

Design Elo

—

Speed

—

tokens/sec

Latency

—

first token

Input price

$3.00

300th cheapest

Context

16K

2K max out

How it compares

Cheaper than10%

of all ranked models

Overview

This is a series of models designed to replicate the prose quality of the Claude 3 models, specifically Sonnet(https://openrouter.ai/anthropic/claude-3.5-sonnet) and Opus(https://openrouter.ai/anthropic/claude-3-opus). The model is fine-tuned on top of [Qwen2.5 72B](https://openrouter.ai/qwen/qwen-2.5-72b-instruct).

Providers & pricing (1)

Provider	In $/M	Out $/M	Context	Uptime
Mancer 2fp8	$3.00	$5.00	16K	—

Specifications

Context window16K

Max output2K

Knowledge cutoffJun 2024

Input modalitiestext

Output modalitiestext

Prompt caching—

Cache read price—

ModeratedNo

Open weightsanthracite-org/magnum-v4-72b ↗

Magnum v4 72B FAQ

How much does Magnum v4 72B cost?

Magnum v4 72B costs $3.00 per million input tokens and $5.00 per million output tokens via OpenRouter, making it 300th cheapest of 332 paid models.

What is Magnum v4 72B's context window?

Magnum v4 72B supports a 16K-token context window and can output up to 2K tokens. It accepts text input.