nousresearch/hermes-4-405b
Hermes 4 is a large-scale reasoning model built on Meta-Llama-3.1-405B and released by Nous Research. It introduces a hybrid reasoning mode, where the model can choose to deliberate internally with...
| Provider | In $/M | Out $/M | Context | Uptime |
|---|---|---|---|---|
| Nebiusfp8 | $1.00 | $3.00 | 131K | 100% |
Hermes 4 405B costs $1.00 per million input tokens and $3.00 per million output tokens via OpenRouter, making it 207th cheapest of 298 paid models.
Hermes 4 405B scores 18.6 on the Artificial Analysis Intelligence Index, ranking 128th of 178 benchmarked models, with a GPQA Diamond score of 73%.
Hermes 4 405B generates around 32 tokens per second with 465ms time-to-first-token (p50), the 230th fastest tracked model.
Hermes 4 405B supports a 131K-token context window. It accepts text input.