modelgrep
N

Nous: Hermes 3 70B Instruct

nousresearch/hermes-3-llama-3.1-70b

JSON
Use via OpenRouter ↗
Intelligence
Design Elo
Speed
25
254th fastest
Latency
334ms
first token
Input price
$0.700
185th cheapest
Context
131K
16K max out

How it compares

Faster than14%
of all ranked models
Cheaper than38%
of all ranked models

Overview

Hermes 3 is a generalist language model with many improvements over [Hermes 2](/models/nousresearch/nous-hermes-2-mistral-7b-dpo), including advanced agentic capabilities, much better roleplaying, reasoning, multi-turn conversation, long context coherence, and improvements across the...

Benchmarks

independent · via OpenRouter
Artificial Analysis
GPQA Diamond
40%
Humanity's Last Exam
4%
SciCode
23%

Providers & pricing (1)

ProviderIn $/MOut $/MUptime
DeepInfrafp8$0.700$0.700100%

Specifications

Context window131K
Max output16K
Knowledge cutoffDec 2023
Input modalitiestext
Output modalitiestext
Prompt caching
Cache read price
ModeratedNo

Hermes 3 70B Instruct FAQ

How much does Hermes 3 70B Instruct cost?

Hermes 3 70B Instruct costs $0.700 per million input tokens and $0.700 per million output tokens via OpenRouter, making it 185th cheapest of 298 paid models.

How fast is Hermes 3 70B Instruct?

Hermes 3 70B Instruct generates around 25 tokens per second with 334ms time-to-first-token (p50), the 254th fastest tracked model.

What is Hermes 3 70B Instruct's context window?

Hermes 3 70B Instruct supports a 131K-token context window and can output up to 16K tokens. It accepts text input.

Compare head-to-head