Fastest DeepSeek Models

Quick answer · Updated June 2026

The fastest DeepSeek model is DeepSeek V3.1 at 92 output tokens per second. DeepSeek V4 Flash (73 t/s) and R1 (73 t/s) round out the top three.

92 t/sSpeed

28.1Intelligence

$0.210Input /M

164KContext

AI models ranked by output speed (tokens per second, p50). The fastest large language models for low-latency and high-throughput applications.

1D
deepseek-chat-v3.1
ReasoningToolsJSON28.1 intel · $0.210/M · 330ms ttft
92 t/s
Speed
2D
deepseek-v4-flash
ReasoningToolsJSON46.0 intel · $0.090/M · 537ms ttft
73 t/s
Speed
3D
deepseek-r1
ReasoningToolsJSON18.8 intel · $0.700/M · 1.4s ttft
73 t/s
Speed
4D
deepseek-v3.2
ReasoningToolsJSON41.7 intel · $0.229/M · 534ms ttft
59 t/s
Speed
5D
deepseek-v4-pro
ReasoningToolsJSON39.3 intel · $0.435/M · 664ms ttft
58 t/s
Speed
6D
deepseek-chat-v3-0324
ToolsJSON22.3 intel · $0.200/M · 846ms ttft
36 t/s
Speed
7D
deepseek-r1-distill-llama-70b
Reasoning$0.800/M · 761ms ttft · 128K ctx
31 t/s
Speed
8D
deepseek-r1-0528
ReasoningToolsJSON27.1 intel · $0.500/M · 695ms ttft
30 t/s
Speed
9D
deepseek-v3.1-terminus
ReasoningToolsJSON28.5 intel · $0.270/M · 881ms ttft
27 t/s
Speed
10D
deepseek-r1-distill-qwen-32b
ReasoningJSON$0.290/M · 859ms ttft · 128K ctx
23 t/s
Speed
11D
deepseek-chat
ToolsJSON$0.200/M · 560ms ttft · 131K ctx
22 t/s
Speed
12D
deepseek-v3.2-exp
ReasoningToolsJSON32.1 intel · $0.270/M · 1.5s ttft
19 t/s
Speed

Frequently asked

What is the fastest DeepSeek model?

The fastest DeepSeek model is DeepSeek V3.1 at 92 output tokens per second. DeepSeek V4 Flash (73 t/s) and R1 (73 t/s) round out the top three.

What's a good alternative to DeepSeek V3.1?

DeepSeek V4 Flash (73 t/s) is the closest alternative on this metric, followed by R1 (73 t/s). See the full ranking above for the tradeoffs.

How many DeepSeek models are there?

modelgrep tracks 12 DeepSeek models with live benchmarks, speed, latency and per-provider pricing, led on intelligence by DeepSeek V4 Flash. 12 of them qualify for this ranking.

More DeepSeek rankings

DeepSeek: Smartest LLMs DeepSeek: Best LLMs for Coding DeepSeek: Best LLMs for Design & Frontend DeepSeek: Lowest-Latency LLMs DeepSeek: Cheapest LLMs DeepSeek: Best Free LLMs DeepSeek: Best Reasoning LLMs DeepSeek: Best Vision LLMs DeepSeek: Best LLMs for Agents DeepSeek: Best Open-Source LLMs DeepSeek: Longest-Context LLMs

All rankings

Small & Fast LLMs Smartest LLMs Best LLMs for Coding Best LLMs for Design & Frontend Lowest-Latency LLMs Cheapest LLMs Best Free LLMs Best Reasoning LLMs Best Vision LLMs Best LLMs for Agents Best Open-Source LLMs Longest-Context LLMs