nousresearch/hermes-3-llama-3.1-405b
Hermes 3 is a generalist language model with many improvements over Hermes 2, including advanced agentic capabilities, much better roleplaying, reasoning, multi-turn conversation, long context coherence, and improvements across the...
| Provider | In $/M | Out $/M | Context | Uptime |
|---|---|---|---|---|
| DeepInfrafp8 | $1.00 | $1.00 | 131K | 100% |
Hermes 3 405B Instruct costs $1.00 per million input tokens and $1.00 per million output tokens via OpenRouter, making it 209th cheapest of 298 paid models.
Hermes 3 405B Instruct scores 17.6 on the Artificial Analysis Intelligence Index, ranking 133rd of 178 benchmarked models, with a GPQA Diamond score of 54%.
Hermes 3 405B Instruct generates around 21 tokens per second with 391ms time-to-first-token (p50), the 261st fastest tracked model.
Hermes 3 405B Instruct supports a 131K-token context window and can output up to 16K tokens. It accepts text input.