Lowest-Latency Prime Intellect Models

Quick answer · Updated June 2026

INTELLECT-3 has the lowest latency of any Prime Intellect model, responding in about 210ms to first token.

210msLatency

22.2Intelligence

30 t/sSpeed

$0.200Input /M

131KContext

AI models ranked by time-to-first-token (p50). The most responsive large language models for real-time and interactive use cases.

1P
intellect-3
ReasoningToolsJSON22.2 intel · $0.200/M · 30 t/s
210ms
Latency

Frequently asked

Which Prime Intellect model has the lowest latency?

INTELLECT-3 has the lowest latency of any Prime Intellect model, responding in about 210ms to first token.

How many Prime Intellect models are there?

modelgrep tracks 1 Prime Intellect models with live benchmarks, speed, latency and per-provider pricing, led on intelligence by INTELLECT-3. 1 of them qualify for this ranking.

More Prime Intellect rankings

Prime Intellect: Smartest LLMs Prime Intellect: Best LLMs for Coding Prime Intellect: Best LLMs for Design & Frontend Prime Intellect: Fastest LLMs Prime Intellect: Cheapest LLMs Prime Intellect: Best Free LLMs Prime Intellect: Best Reasoning LLMs Prime Intellect: Best Vision LLMs Prime Intellect: Best LLMs for Agents Prime Intellect: Best Open-Source LLMs Prime Intellect: Longest-Context LLMs

All rankings

Small & Fast LLMs Smartest LLMs Best LLMs for Coding Best LLMs for Design & Frontend Fastest LLMs Cheapest LLMs Best Free LLMs Best Reasoning LLMs Best Vision LLMs Best LLMs for Agents Best Open-Source LLMs Longest-Context LLMs