Cheapest NVIDIA Models

Match · Updated July 2026

The cheapest NVIDIA model is Nemotron 3 Nano 30B A3B at $0.050 per million input tokens. Nemotron 3 Super ($0.085) and Nemotron 3 Ultra ($0.600) round out the top three.

$0.050Input /M

7.4Intelligence

262KContext

AI models ranked by input token price. The cheapest LLM APIs and most affordable AI models, from budget open-weight models to discounted frontier models.

Frequently asked

What is the cheapest NVIDIA model?

The cheapest NVIDIA model is Nemotron 3 Nano 30B A3B at $0.050 per million input tokens. Nemotron 3 Super ($0.085) and Nemotron 3 Ultra ($0.600) round out the top three.

What's a good alternative to Nemotron 3 Nano 30B A3B?

Nemotron 3 Super ($0.085) is the closest alternative on this metric, followed by Nemotron 3 Ultra ($0.600). See the full ranking above for the tradeoffs.

How many NVIDIA models are there?

modelgrep tracks 10 NVIDIA models with live benchmarks, speed, latency and per-provider pricing, led on intelligence by Nemotron 3 Nano Omni (free). 3 of them qualify for this ranking.

More NVIDIA rankings

NVIDIA: Smartest LLMs NVIDIA: Best LLMs for Coding NVIDIA: Best LLMs for Design & Frontend NVIDIA: Fastest LLMs NVIDIA: Lowest-Latency LLMs NVIDIA: Best Free LLMs NVIDIA: Best Reasoning LLMs NVIDIA: Best Vision LLMs NVIDIA: Best LLMs for Agents NVIDIA: Best Open-Source LLMs NVIDIA: Longest-Context LLMs NVIDIA: Best LLMs for Writing NVIDIA: Best LLMs for Math & Science NVIDIA: Best LLMs for RAG NVIDIA: Best LLMs for SQL & Data Analysis NVIDIA: Best LLMs for Roleplay NVIDIA: Best Uncensored LLMs

All rankings

Small & Fast LLMs Best Local LLMs Smartest LLMs Best LLMs for Coding Best LLMs for Design & Frontend Fastest LLMs Lowest-Latency LLMs Best Free LLMs Best Reasoning LLMs Best Vision LLMs Best LLMs for Agents Best Open-Source LLMs Longest-Context LLMs Best LLMs for Writing Best LLMs for Math & Science Best LLMs for RAG Best LLMs for SQL & Data Analysis Best LLMs for Roleplay Best Uncensored LLMs