nousresearch/hermes-3-llama-3.1-70b
Hermes 3 is a generalist language model with many improvements over [Hermes 2](/models/nousresearch/nous-hermes-2-mistral-7b-dpo), including advanced agentic capabilities, much better roleplaying, reasoning, multi-turn conversation, long context coherence, and improvements across the...
| Provider | In $/M | Out $/M | Context | Uptime |
|---|---|---|---|---|
| DeepInfrafp8 | $0.700 | $0.700 | 131K | 100% |
Hermes 3 70B Instruct costs $0.700 per million input tokens and $0.700 per million output tokens via OpenRouter, making it 185th cheapest of 298 paid models.
Hermes 3 70B Instruct generates around 25 tokens per second with 334ms time-to-first-token (p50), the 254th fastest tracked model.
Hermes 3 70B Instruct supports a 131K-token context window and can output up to 16K tokens. It accepts text input.