modelgrep

Fastest MoonshotAI Models

Quick answer · Updated June 2026

The fastest MoonshotAI model is Kimi K2.6 at 162 output tokens per second. Kimi K2 0905 (139 t/s) and Kimi K2 Thinking (103 t/s) round out the top three.

162 t/sSpeed
42.9Intelligence
$0.680Input /M
262KContext

AI models ranked by output speed (tokens per second, p50). The fastest large language models for low-latency and high-throughput applications.

  1. 1M
    kimi-k2.6
    ReasoningToolsJSON+142.9 intel · $0.680/M · 458ms ttft
    162 t/s
    Speed
  2. 2M
    kimi-k2-0905
    ToolsJSON30.9 intel · $0.600/M · 220ms ttft
    139 t/s
    Speed
  3. 3M
    kimi-k2-thinking
    ReasoningToolsJSON24.1 intel · $0.600/M · 412ms ttft
    103 t/s
    Speed
  4. 4M
    kimi-k2.5
    ReasoningToolsJSON+137.3 intel · $0.375/M · 211ms ttft
    89 t/s
    Speed
  5. 5M
    kimi-k2.7-code
    ReasoningToolsJSON+1$0.750/M · 378ms ttft · 262K ctx
    63 t/s
    Speed
  6. 6M
    kimi-k2
    Tools14.4 intel · $0.570/M · 1.6s ttft
    15 t/s
    Speed

Frequently asked

What is the fastest MoonshotAI model?

The fastest MoonshotAI model is Kimi K2.6 at 162 output tokens per second. Kimi K2 0905 (139 t/s) and Kimi K2 Thinking (103 t/s) round out the top three.

What's a good alternative to Kimi K2.6?

Kimi K2 0905 (139 t/s) is the closest alternative on this metric, followed by Kimi K2 Thinking (103 t/s). See the full ranking above for the tradeoffs.

How many MoonshotAI models are there?

modelgrep tracks 6 MoonshotAI models with live benchmarks, speed, latency and per-provider pricing, led on intelligence by Kimi K2.6. 6 of them qualify for this ranking.

More MoonshotAI rankings

All rankings