modelgrep

DeepSeek: DeepSeek V3.1 Terminus vs xAI: Grok 4.20

xAI: Grok 4.20 wins on more metrics (5 of 9), but the right pick depends on what you optimize for — see the breakdown below.

MetricDeepSeek: DeepSeek V3.1 TerminusxAI: Grok 4.20
Intelligence Index28.529.7
Coding Index31.925.4
GPQA Diamond75%79%
Design Arena Elo1238
Speed (tokens/sec)2876
Latency848ms707ms
Input price /M$0.270$1.25
Output price /M$0.950$2.50
Context window164K2M
CapabilitiesReasoningToolsJSONReasoningToolsJSONVision