modelgrep

Lowest-Latency Anthracite-org Models

Quick answer · Updated June 2026

Magnum v4 72B has the lowest latency of any Anthracite-org model, responding in about 1.5s to first token.

1.5sLatency
30 t/sSpeed
$3.00Input /M
33KContext

AI models ranked by time-to-first-token (p50). The most responsive large language models for real-time and interactive use cases.

  1. 1A
    magnum-v4-72b
    JSON$3.00/M · 30 t/s · 33K ctx
    1.5s
    Latency

Frequently asked

Which Anthracite-org model has the lowest latency?

Magnum v4 72B has the lowest latency of any Anthracite-org model, responding in about 1.5s to first token.

How many Anthracite-org models are there?

modelgrep tracks 1 Anthracite-org models with live benchmarks, speed, latency and per-provider pricing. 1 of them qualify for this ranking.

More Anthracite-org rankings

All rankings