modelgrep

Fastest AI21 Models

Quick answer · Updated June 2026

The fastest AI21 model is Jamba Large 1.7 at 19 output tokens per second.

19 t/sSpeed
10.9Intelligence
$2.00Input /M
256KContext

AI models ranked by output speed (tokens per second, p50). The fastest large language models for low-latency and high-throughput applications.

  1. 1A
    jamba-large-1.7
    ToolsJSON10.9 intel · $2.00/M · 809ms ttft
    19 t/s
    Speed

Frequently asked

What is the fastest AI21 model?

The fastest AI21 model is Jamba Large 1.7 at 19 output tokens per second.

How many AI21 models are there?

modelgrep tracks 1 AI21 models with live benchmarks, speed, latency and per-provider pricing, led on intelligence by Jamba Large 1.7. 1 of them qualify for this ranking.

More AI21 rankings

All rankings