Nous: Hermes 4 70B

nousresearch/hermes-4-70b

Cheaper than 80% of paidReasoningJSON

Use via OpenRouter ↗

Intelligence

—

Design Elo

—

Speed

—

tokens/sec

Latency

—

first token

Input price

$0.130

67th cheapest

Context

131K

How it compares

Cheaper than80%

of all ranked models

Overview

Hermes 4 70B is a hybrid reasoning model from Nous Research, built on Meta-Llama-3.1-70B. It introduces the same hybrid mode as the larger 405B release, allowing the model to either...

Providers & pricing (1)

Provider	In $/M	Out $/M	Context	Uptime
Nebiusfp8	$0.130	$0.400	131K	100%

Specifications

Context window131K

Max output—

Knowledge cutoffAug 2024

Input modalitiestext

Output modalitiestext

Prompt caching—

Cache read price—

ModeratedNo

Open weightsNousResearch/Hermes-4-70B ↗

Hermes 4 70B FAQ

How much does Hermes 4 70B cost?

Hermes 4 70B costs $0.130 per million input tokens and $0.400 per million output tokens via OpenRouter, making it 67th cheapest of 332 paid models.

What is Hermes 4 70B's context window?

Hermes 4 70B supports a 131K-token context window. It accepts text input.

More from nousresearch

All Hermes 4 70B alternatives →

nousresearch/hermes-4-405b

Intel —$1.00/M

nousresearch/hermes-3-llama-3.1-70b

Intel 5.1$0.700/M

nousresearch/hermes-3-llama-3.1-405b

Intel —$1.00/M

Compare head-to-head

Nous: Hermes 4 70B vs Nous: Hermes 4 405B Nous: Hermes 4 70B vs Nous: Hermes 3 70B Instruct Nous: Hermes 4 70B vs Nous: Hermes 3 405B Instruct