modelgrep

Best Sakana Vision Models

Quick answer · Updated June 2026

Fugu Ultra is the best vision-capable Sakana model, pairing — intelligence with image and document understanding.

Intelligence
44 t/sSpeed
$5.00Input /M
1MContext

Multimodal large language models that accept image input, ranked by intelligence. The best vision-capable AI models for understanding images, documents and charts.

  1. 1S
    fugu-ultra
    ReasoningToolsJSON+1$5.00/M · 44 t/s · 7.6s ttft
    Intelligence

Frequently asked

What is the best Sakana model for vision?

Fugu Ultra is the best vision-capable Sakana model, pairing — intelligence with image and document understanding.

How many Sakana models are there?

modelgrep tracks 1 Sakana models with live benchmarks, speed, latency and per-provider pricing. 1 of them qualify for this ranking.

More Sakana rankings

All rankings