modelgrep

Lowest-Latency Sakana Models

Quick answer · Updated June 2026

Fugu Ultra has the lowest latency of any Sakana model, responding in about 7.6s to first token.

7.6sLatency
44 t/sSpeed
$5.00Input /M
1MContext

AI models ranked by time-to-first-token (p50). The most responsive large language models for real-time and interactive use cases.

  1. 1S
    fugu-ultra
    ReasoningToolsJSON+1$5.00/M · 44 t/s · 1M ctx
    7.6s
    Latency

Frequently asked

Which Sakana model has the lowest latency?

Fugu Ultra has the lowest latency of any Sakana model, responding in about 7.6s to first token.

How many Sakana models are there?

modelgrep tracks 1 Sakana models with live benchmarks, speed, latency and per-provider pricing. 1 of them qualify for this ranking.

More Sakana rankings

All rankings