Fastest Deep Cogito Models

Quick answer · Updated June 2026

The fastest Deep Cogito model is Cogito v2.1 671B at 27 output tokens per second.

27 t/sSpeed

$1.25Input /M

128KContext

AI models ranked by output speed (tokens per second, p50). The fastest large language models for low-latency and high-throughput applications.

1D
cogito-v2.1-671b
ReasoningJSON$1.25/M · 325ms ttft · 128K ctx
27 t/s
Speed

Frequently asked

What is the fastest Deep Cogito model?

The fastest Deep Cogito model is Cogito v2.1 671B at 27 output tokens per second.

How many Deep Cogito models are there?

modelgrep tracks 1 Deep Cogito models with live benchmarks, speed, latency and per-provider pricing. 1 of them qualify for this ranking.

More Deep Cogito rankings

Deep Cogito: Smartest LLMs Deep Cogito: Best LLMs for Coding Deep Cogito: Best LLMs for Design & Frontend Deep Cogito: Lowest-Latency LLMs Deep Cogito: Cheapest LLMs Deep Cogito: Best Free LLMs Deep Cogito: Best Reasoning LLMs Deep Cogito: Best Vision LLMs Deep Cogito: Best LLMs for Agents Deep Cogito: Best Open-Source LLMs Deep Cogito: Longest-Context LLMs

All rankings

Small & Fast LLMs Smartest LLMs Best LLMs for Coding Best LLMs for Design & Frontend Lowest-Latency LLMs Cheapest LLMs Best Free LLMs Best Reasoning LLMs Best Vision LLMs Best LLMs for Agents Best Open-Source LLMs Longest-Context LLMs