modelgrep

Lowest-Latency Xiaomi Models

Quick answer · Updated June 2026

MiMo-V2.5-Pro has the lowest latency of any Xiaomi model, responding in about 198ms to first token. MiMo-V2-Flash (536ms) and MiMo-V2.5 (2.4s) round out the top three.

198msLatency
53.8Intelligence
31 t/sSpeed
$0.435Input /M
1.0MContext

AI models ranked by time-to-first-token (p50). The most responsive large language models for real-time and interactive use cases.

  1. 1X
    mimo-v2.5-pro
    ReasoningToolsJSON53.8 intel · $0.435/M · 31 t/s
    198ms
    Latency
  2. 2X
    mimo-v2-flash
    ReasoningToolsJSON30.3 intel · $0.100/M · 87 t/s
    536ms
    Latency
  3. 3X
    mimo-v2.5
    ReasoningToolsJSON+249.0 intel · $0.140/M · 44 t/s
    2.4s
    Latency

Frequently asked

Which Xiaomi model has the lowest latency?

MiMo-V2.5-Pro has the lowest latency of any Xiaomi model, responding in about 198ms to first token. MiMo-V2-Flash (536ms) and MiMo-V2.5 (2.4s) round out the top three.

What's a good alternative to MiMo-V2.5-Pro?

MiMo-V2-Flash (536ms) is the closest alternative on this metric, followed by MiMo-V2.5 (2.4s). See the full ranking above for the tradeoffs.

How many Xiaomi models are there?

modelgrep tracks 3 Xiaomi models with live benchmarks, speed, latency and per-provider pricing, led on intelligence by MiMo-V2.5-Pro. 3 of them qualify for this ranking.

More Xiaomi rankings

All rankings