modelgrep

Lowest-Latency Relace Models

Quick answer · Updated June 2026

Relace Search has the lowest latency of any Relace model, responding in about 767ms to first token.

767msLatency
8 t/sSpeed
$1.00Input /M
256KContext

AI models ranked by time-to-first-token (p50). The most responsive large language models for real-time and interactive use cases.

  1. 1R
    relace-search
    Tools$1.00/M · 8 t/s · 256K ctx
    767ms
    Latency

Frequently asked

Which Relace model has the lowest latency?

Relace Search has the lowest latency of any Relace model, responding in about 767ms to first token.

How many Relace models are there?

modelgrep tracks 2 Relace models with live benchmarks, speed, latency and per-provider pricing. 1 of them qualify for this ranking.

More Relace rankings

All rankings