modelgrep

Lowest-Latency Morph Models

Quick answer · Updated June 2026

Morph V3 Large has the lowest latency of any Morph model, responding in about 408ms to first token.

408msLatency
2.7k t/sSpeed
$0.900Input /M
262KContext

AI models ranked by time-to-first-token (p50). The most responsive large language models for real-time and interactive use cases.

  1. 1M
    morph-v3-large
    $0.900/M · 2.7k t/s · 262K ctx
    408ms
    Latency

Frequently asked

Which Morph model has the lowest latency?

Morph V3 Large has the lowest latency of any Morph model, responding in about 408ms to first token.

How many Morph models are there?

modelgrep tracks 2 Morph models with live benchmarks, speed, latency and per-provider pricing. 1 of them qualify for this ranking.

More Morph rankings

All rankings