modelgrep

Lowest-Latency Gryphe Models

Quick answer · Updated June 2026

MythoMax 13B has the lowest latency of any Gryphe model, responding in about 252ms to first token.

252msLatency
43 t/sSpeed
$0.060Input /M
4KContext

AI models ranked by time-to-first-token (p50). The most responsive large language models for real-time and interactive use cases.

  1. 1G
    mythomax-l2-13b
    JSON$0.060/M · 43 t/s · 4K ctx
    252ms
    Latency

Frequently asked

Which Gryphe model has the lowest latency?

MythoMax 13B has the lowest latency of any Gryphe model, responding in about 252ms to first token.

How many Gryphe models are there?

modelgrep tracks 1 Gryphe models with live benchmarks, speed, latency and per-provider pricing. 1 of them qualify for this ranking.

More Gryphe rankings

All rankings