nvidia/nemotron-3-nano-30b-a3b
NVIDIA Nemotron 3 Nano 30B A3B is a small language MoE model with highest compute efficiency and accuracy for developers to build specialized agentic AI systems. The model is fully...
| Provider | In $/M | Out $/M | Context | Uptime |
|---|---|---|---|---|
| DeepInfrafp4 | $0.050 | $0.200 | 262K | 94.5% |
| Nebiusfp8 | $0.060 | $0.240 | 262K | 84.5% |
Nemotron 3 Nano 30B A3B costs $0.050 per million input tokens and $0.200 per million output tokens via OpenRouter, making it 16th cheapest of 298 paid models.
Nemotron 3 Nano 30B A3B scores 13.2 on the Artificial Analysis Intelligence Index, ranking 154th of 178 benchmarked models, with a GPQA Diamond score of 40%.
Nemotron 3 Nano 30B A3B generates around 176 tokens per second with 508ms time-to-first-token (p50), the 20th fastest tracked model.
Nemotron 3 Nano 30B A3B supports a 262K-token context window and can output up to 228K tokens. It accepts text input.