modelgrep

Fastest MiniMax Models

Quick answer · Updated June 2026

The fastest MiniMax model is MiniMax M2.7 at 273 output tokens per second. MiniMax M2.5 (183 t/s) and MiniMax M2.1 (149 t/s) round out the top three.

273 t/sSpeed
49.6Intelligence
$0.250Input /M
205KContext

AI models ranked by output speed (tokens per second, p50). The fastest large language models for low-latency and high-throughput applications.

  1. 1M
    minimax-m2.7
    ReasoningToolsJSON49.6 intel · $0.250/M · 465ms ttft
    273 t/s
    Speed
  2. 2M
    minimax-m2.5
    ReasoningToolsJSON41.9 intel · $0.150/M · 532ms ttft
    183 t/s
    Speed
  3. 3M
    minimax-m2.1
    ReasoningToolsJSON39.4 intel · $0.290/M · 769ms ttft
    149 t/s
    Speed
  4. 4M
    minimax-m2
    ReasoningToolsJSON36.1 intel · $0.255/M · 340ms ttft
    103 t/s
    Speed
  5. 5M
    minimax-m3
    ReasoningToolsJSON+154.7 intel · $0.300/M · 689ms ttft
    42 t/s
    Speed
  6. 6M
    minimax-01
    Vision$0.200/M · 823ms ttft · 1.0M ctx
    34 t/s
    Speed
  7. 7M
    minimax-m2-her
    $0.300/M · 915ms ttft · 66K ctx
    19 t/s
    Speed
  8. 8M
    minimax-m1
    ReasoningTools$0.400/M · 840ms ttft · 1M ctx
    18 t/s
    Speed

Frequently asked

What is the fastest MiniMax model?

The fastest MiniMax model is MiniMax M2.7 at 273 output tokens per second. MiniMax M2.5 (183 t/s) and MiniMax M2.1 (149 t/s) round out the top three.

What's a good alternative to MiniMax M2.7?

MiniMax M2.5 (183 t/s) is the closest alternative on this metric, followed by MiniMax M2.1 (149 t/s). See the full ranking above for the tradeoffs.

How many MiniMax models are there?

modelgrep tracks 8 MiniMax models with live benchmarks, speed, latency and per-provider pricing, led on intelligence by MiniMax M3. 8 of them qualify for this ranking.

More MiniMax rankings

All rankings