Fastest AI21 Models

Quick answer · Updated June 2026

The fastest AI21 model is Jamba Large 1.7 at 19 output tokens per second.

19 t/sSpeed

10.9Intelligence

$2.00Input /M

256KContext

AI models ranked by output speed (tokens per second, p50). The fastest large language models for low-latency and high-throughput applications.

1A
jamba-large-1.7
ToolsJSON10.9 intel · $2.00/M · 809ms ttft
19 t/s
Speed

Frequently asked

What is the fastest AI21 model?

The fastest AI21 model is Jamba Large 1.7 at 19 output tokens per second.

How many AI21 models are there?

modelgrep tracks 1 AI21 models with live benchmarks, speed, latency and per-provider pricing, led on intelligence by Jamba Large 1.7. 1 of them qualify for this ranking.

More AI21 rankings

AI21: Smartest LLMs AI21: Best LLMs for Coding AI21: Best LLMs for Design & Frontend AI21: Lowest-Latency LLMs AI21: Cheapest LLMs AI21: Best Free LLMs AI21: Best Reasoning LLMs AI21: Best Vision LLMs AI21: Best LLMs for Agents AI21: Best Open-Source LLMs AI21: Longest-Context LLMs

All rankings

Small & Fast LLMs Smartest LLMs Best LLMs for Coding Best LLMs for Design & Frontend Lowest-Latency LLMs Cheapest LLMs Best Free LLMs Best Reasoning LLMs Best Vision LLMs Best LLMs for Agents Best Open-Source LLMs Longest-Context LLMs