The fastest Deep Cogito model is Cogito v2.1 671B at 27 output tokens per second.
AI models ranked by output speed (tokens per second, p50). The fastest large language models for low-latency and high-throughput applications.
The fastest Deep Cogito model is Cogito v2.1 671B at 27 output tokens per second.
modelgrep tracks 1 Deep Cogito models with live benchmarks, speed, latency and per-provider pricing. 1 of them qualify for this ranking.