Lowest-Latency Sakana Models

Quick answer · Updated June 2026

Fugu Ultra has the lowest latency of any Sakana model, responding in about 7.6s to first token.

7.6sLatency

44 t/sSpeed

$5.00Input /M

1MContext

AI models ranked by time-to-first-token (p50). The most responsive large language models for real-time and interactive use cases.

1S
fugu-ultra
ReasoningToolsJSON+1$5.00/M · 44 t/s · 1M ctx
7.6s
Latency

Frequently asked

Which Sakana model has the lowest latency?

Fugu Ultra has the lowest latency of any Sakana model, responding in about 7.6s to first token.

How many Sakana models are there?

modelgrep tracks 1 Sakana models with live benchmarks, speed, latency and per-provider pricing. 1 of them qualify for this ranking.

More Sakana rankings

Sakana: Smartest LLMs Sakana: Best LLMs for Coding Sakana: Best LLMs for Design & Frontend Sakana: Fastest LLMs Sakana: Cheapest LLMs Sakana: Best Free LLMs Sakana: Best Reasoning LLMs Sakana: Best Vision LLMs Sakana: Best LLMs for Agents Sakana: Best Open-Source LLMs Sakana: Longest-Context LLMs

All rankings

Small & Fast LLMs Smartest LLMs Best LLMs for Coding Best LLMs for Design & Frontend Fastest LLMs Cheapest LLMs Best Free LLMs Best Reasoning LLMs Best Vision LLMs Best LLMs for Agents Best Open-Source LLMs Longest-Context LLMs