Nemotron 3 Ultra (free) has the largest context window of any NVIDIA model, at 1M tokens. Nemotron 3 Ultra (1M) and Nemotron 3 Super (free) (1M) round out the top three.
AI models with the largest context windows, ranked by token capacity. The best large language models for long documents, codebases and extended conversations.
Nemotron 3 Ultra (free) has the largest context window of any NVIDIA model, at 1M tokens. Nemotron 3 Ultra (1M) and Nemotron 3 Super (free) (1M) round out the top three.
Nemotron 3 Ultra (1M) is the closest alternative on this metric, followed by Nemotron 3 Super (free) (1M). See the full ranking above for the tradeoffs.
modelgrep tracks 11 NVIDIA models with live benchmarks, speed, latency and per-provider pricing, led on intelligence by Nemotron 3 Ultra (free). 7 of them qualify for this ranking.