modelgrep

Nous: Hermes 4 405B vs NVIDIA: Llama 3.3 Nemotron Super 49B V1.5

NVIDIA: Llama 3.3 Nemotron Super 49B V1.5 wins on more metrics (4 of 7), but the right pick depends on what you optimize for — see the breakdown below.

MetricNous: Hermes 4 405BNVIDIA: Llama 3.3 Nemotron Super 49B V1.5
Intelligence Index18.614.6
Coding Index16.010.5
GPQA Diamond73%48%
Design Arena Elo
Speed (tokens/sec)3544
Latency345ms246ms
Input price /M$1.00$0.400
Output price /M$3.00$0.400
Context window131K131K
CapabilitiesReasoningJSONReasoningToolsJSON