anthracite-org/magnum-v4-72b
This is a series of models designed to replicate the prose quality of the Claude 3 models, specifically Sonnet(https://openrouter.ai/anthropic/claude-3.5-sonnet) and Opus(https://openrouter.ai/anthropic/claude-3-opus). The model is fine-tuned on top of [Qwen2.5 72B](https://openrouter.ai/qwen/qwen-2.5-72b-instruct).
| Provider | In $/M | Out $/M | Context | Uptime |
|---|---|---|---|---|
| Mancer 2fp8 | $3.00 | $5.00 | 16K | — |
Magnum v4 72B costs $3.00 per million input tokens and $5.00 per million output tokens via OpenRouter, making it 270th cheapest of 298 paid models.
Magnum v4 72B generates around 26 tokens per second with 1.2s time-to-first-token (p50), the 250th fastest tracked model.
Magnum v4 72B supports a 33K-token context window and can output up to 2K tokens. It accepts text input.