The fastest AI21 model is Jamba Large 1.7 at 19 output tokens per second.
AI models ranked by output speed (tokens per second, p50). The fastest large language models for low-latency and high-throughput applications.
The fastest AI21 model is Jamba Large 1.7 at 19 output tokens per second.
modelgrep tracks 1 AI21 models with live benchmarks, speed, latency and per-provider pricing, led on intelligence by Jamba Large 1.7. 1 of them qualify for this ranking.