Lowest-Latency Anthracite-org Models

Quick answer · Updated June 2026

Magnum v4 72B has the lowest latency of any Anthracite-org model, responding in about 1.5s to first token.

1.5sLatency

30 t/sSpeed

$3.00Input /M

33KContext

AI models ranked by time-to-first-token (p50). The most responsive large language models for real-time and interactive use cases.

1A
magnum-v4-72b
JSON$3.00/M · 30 t/s · 33K ctx
1.5s
Latency

Frequently asked

Which Anthracite-org model has the lowest latency?

Magnum v4 72B has the lowest latency of any Anthracite-org model, responding in about 1.5s to first token.

How many Anthracite-org models are there?

modelgrep tracks 1 Anthracite-org models with live benchmarks, speed, latency and per-provider pricing. 1 of them qualify for this ranking.

More Anthracite-org rankings

Anthracite-org: Smartest LLMs Anthracite-org: Best LLMs for Coding Anthracite-org: Best LLMs for Design & Frontend Anthracite-org: Fastest LLMs Anthracite-org: Cheapest LLMs Anthracite-org: Best Free LLMs Anthracite-org: Best Reasoning LLMs Anthracite-org: Best Vision LLMs Anthracite-org: Best LLMs for Agents Anthracite-org: Best Open-Source LLMs Anthracite-org: Longest-Context LLMs

All rankings

Small & Fast LLMs Smartest LLMs Best LLMs for Coding Best LLMs for Design & Frontend Fastest LLMs Cheapest LLMs Best Free LLMs Best Reasoning LLMs Best Vision LLMs Best LLMs for Agents Best Open-Source LLMs Longest-Context LLMs