nvidia/nemotron-3-super-120b-a12b
NVIDIA Nemotron 3 Super is a 120B-parameter open hybrid MoE model, activating just 12B parameters for maximum compute efficiency and accuracy in complex multi-agent applications. Built on a hybrid Mamba-Transformer...
| Provider | In $/M | Out $/M | Context | Uptime |
|---|---|---|---|---|
| DekaLLMfp8 | $0.090 | $0.450 | 262K | 99.1% |
| DeepInfrabf16 | $0.100 | $0.500 | 262K | 98.2% |
| DigitalOcean | $0.300 | $0.650 | 1M | 99.8% |
| Nebiusfp4 | $0.300 | $0.900 | 262K | — |
Nemotron 3 Super costs $0.090 per million input tokens and $0.450 per million output tokens via OpenRouter, making it 42nd cheapest of 298 paid models.
Nemotron 3 Super scores 36.0 on the Artificial Analysis Intelligence Index, ranking 58th of 180 benchmarked models, with a GPQA Diamond score of 80%.
Nemotron 3 Super generates around 240 tokens per second with 1.2s time-to-first-token (p50), the 15th fastest tracked model.
Nemotron 3 Super supports a 1M-token context window. It accepts text input.