modelgrep

Lowest-Latency Prime Intellect Models

Quick answer · Updated June 2026

INTELLECT-3 has the lowest latency of any Prime Intellect model, responding in about 210ms to first token.

210msLatency
22.2Intelligence
30 t/sSpeed
$0.200Input /M
131KContext

AI models ranked by time-to-first-token (p50). The most responsive large language models for real-time and interactive use cases.

  1. 1P
    intellect-3
    ReasoningToolsJSON22.2 intel · $0.200/M · 30 t/s
    210ms
    Latency

Frequently asked

Which Prime Intellect model has the lowest latency?

INTELLECT-3 has the lowest latency of any Prime Intellect model, responding in about 210ms to first token.

How many Prime Intellect models are there?

modelgrep tracks 1 Prime Intellect models with live benchmarks, speed, latency and per-provider pricing, led on intelligence by INTELLECT-3. 1 of them qualify for this ranking.

More Prime Intellect rankings

All rankings