modelgrep

DeepSeek: R1 vs Nous: Hermes 4 405B

DeepSeek: R1 wins on more metrics (5 of 8), but the right pick depends on what you optimize for — see the breakdown below.

MetricDeepSeek: R1Nous: Hermes 4 405B
Intelligence Index18.818.6
Coding Index15.916.0
GPQA Diamond71%73%
Design Arena Elo
Speed (tokens/sec)7231
Latency1.4s353ms
Input price /M$0.700$1.00
Output price /M$2.50$3.00
Context window164K131K
CapabilitiesReasoningToolsJSONReasoningJSON