modelgrep

Lowest-Latency Poolside Models

Quick answer · Updated June 2026

Laguna XS.2 (free) has the lowest latency of any Poolside model, responding in about 457ms to first token. Laguna M.1 (free) (2.3s) is next.

457msLatency
114 t/sSpeed
FreeInput /M
262KContext

AI models ranked by time-to-first-token (p50). The most responsive large language models for real-time and interactive use cases.

  1. 1P
    laguna-xs.2:free
    ReasoningToolsFree/M · 114 t/s · 262K ctx
    457ms
    Latency
  2. 2P
    laguna-m.1:free
    ReasoningToolsFree/M · 26 t/s · 262K ctx
    2.3s
    Latency

Frequently asked

Which Poolside model has the lowest latency?

Laguna XS.2 (free) has the lowest latency of any Poolside model, responding in about 457ms to first token. Laguna M.1 (free) (2.3s) is next.

What's a good alternative to Laguna XS.2 (free)?

Laguna M.1 (free) (2.3s) is the closest alternative on this metric. See the full ranking above for the tradeoffs.

How many Poolside models are there?

modelgrep tracks 2 Poolside models with live benchmarks, speed, latency and per-provider pricing. 2 of them qualify for this ranking.

More Poolside rankings

All rankings