Lowest-Latency Relace Models

Quick answer · Updated June 2026

Relace Search has the lowest latency of any Relace model, responding in about 767ms to first token.

767msLatency

8 t/sSpeed

$1.00Input /M

256KContext

AI models ranked by time-to-first-token (p50). The most responsive large language models for real-time and interactive use cases.

1R
relace-search
Tools$1.00/M · 8 t/s · 256K ctx
767ms
Latency

Frequently asked

Which Relace model has the lowest latency?

Relace Search has the lowest latency of any Relace model, responding in about 767ms to first token.

How many Relace models are there?

modelgrep tracks 2 Relace models with live benchmarks, speed, latency and per-provider pricing. 1 of them qualify for this ranking.

More Relace rankings

Relace: Smartest LLMs Relace: Best LLMs for Coding Relace: Best LLMs for Design & Frontend Relace: Fastest LLMs Relace: Cheapest LLMs Relace: Best Free LLMs Relace: Best Reasoning LLMs Relace: Best Vision LLMs Relace: Best LLMs for Agents Relace: Best Open-Source LLMs Relace: Longest-Context LLMs

All rankings

Small & Fast LLMs Smartest LLMs Best LLMs for Coding Best LLMs for Design & Frontend Fastest LLMs Cheapest LLMs Best Free LLMs Best Reasoning LLMs Best Vision LLMs Best LLMs for Agents Best Open-Source LLMs Longest-Context LLMs