modelgrep
N

NVIDIA models

11models tracked · ranked by intelligence, speed & price

Quick answer · Updated June 2026

NVIDIA's smartest model is Nemotron 3 Ultra (free) (47.7 on the Intelligence Index), its fastest is Nemotron 3 Nano 30B A3B (free) at 180 tokens/sec, and its cheapest is Nemotron 3 Nano 30B A3B at $0.050 per million input tokens. All 11 NVIDIA models are compared below by intelligence, speed, latency, context and price.

Smartest
47.7
nvidia/nemotron-3-ultra-550b-a55b:free
Fastest
180 t/s
nvidia/nemotron-3-nano-30b-a3b:free
Cheapest
$0.050/M
nvidia/nemotron-3-nano-30b-a3b
ModelIntelIn $/MContext
nemotron-3-ultra-550b-a55b:free
ReasoningTools
47.7Free1M
nemotron-3-ultra-550b-a55b
ReasoningToolsJSON
47.7$0.5001M
nemotron-3-super-120b-a12b:free
ReasoningToolsJSON
36.0Free1M
nemotron-3-super-120b-a12b
ReasoningToolsJSON
36.0$0.0901M
nemotron-3-nano-omni-30b-a3b-reasoning:free
ReasoningToolsVision+1
21.4Free256K
llama-3.3-nemotron-super-49b-v1.5
ReasoningToolsJSON
18.7$0.400131K
nemotron-nano-12b-v2-vl:free
ReasoningToolsVision
14.9Free128K
nemotron-nano-9b-v2:free
ReasoningToolsJSON
14.8Free128K
nemotron-3-nano-30b-a3b:free
ReasoningTools
13.2Free256K
nemotron-3-nano-30b-a3b
ReasoningToolsJSON
13.2$0.050262K
nemotron-3.5-content-safety:free
ReasoningVision
Free128K