NVIDIA models

10models tracked · ranked by intelligence, speed & price

Match · Updated July 2026

NVIDIA's smartest model is Nemotron 3 Nano Omni (free) (14.9 on the Intelligence Index), and its cheapest is Nemotron 3 Nano 30B A3B at $0.050 per million input tokens. All 10 NVIDIA models are compared below by intelligence, speed, latency, context and price.

Smartest NVIDIA Fastest NVIDIA Cheapest NVIDIA Small & fast NVIDIA NVIDIA for coding NVIDIA reasoning NVIDIA for design NVIDIA for agents

Smartest

14.9

nvidia/nemotron-3-nano-omni-30b-a3b-reasoning:free

Fastest

—

Cheapest

$0.050/M

nvidia/nemotron-3-nano-30b-a3b

Model	Intel	Speed	Latency	In $/M	Context
nemotron-3-nano-omni-30b-a3b-reasoning:free ReasoningToolsVision+1	14.9	—	—	Free	256K
nemotron-3-nano-30b-a3b ReasoningToolsJSON	7.4	—	—	$0.050	262K
nemotron-3.5-content-safety:free ReasoningVision	—	—	—	Free	128K
nemotron-3-ultra-550b-a55b ReasoningToolsJSON	—	—	—	$0.600	512K
nemotron-3-ultra-550b-a55b:free ReasoningTools	—	—	—	Free	1M
nemotron-3-super-120b-a12b ReasoningToolsJSON	—	—	—	$0.085	1M
nemotron-3-super-120b-a12b:free ReasoningToolsJSON	—	—	—	Free	262K
nemotron-3-nano-30b-a3b:free ReasoningTools	—	—	—	Free	256K
nemotron-nano-12b-v2-vl:free ReasoningToolsVision	—	—	—	Free	128K
nemotron-nano-9b-v2:free ReasoningToolsJSON	—	—	—	Free	128K