Nous: Hermes 4 405B

nousresearch/hermes-4-405b

ReasoningJSON

Use via OpenRouter ↗

Intelligence

—

Design Elo

—

Speed

—

tokens/sec

Latency

—

first token

Input price

$1.00

228th cheapest

Context

131K

How it compares

Cheaper than31%

of all ranked models

Overview

Hermes 4 is a large-scale reasoning model built on Meta-Llama-3.1-405B and released by Nous Research. It introduces a hybrid reasoning mode, where the model can choose to deliberate internally with...

Providers & pricing (1)

Provider	In $/M	Out $/M	Context	Uptime
Nebiusfp8	$1.00	$3.00	131K	100%

Specifications

Context window131K

Max output—

Knowledge cutoffAug 2024

Input modalitiestext

Output modalitiestext

Prompt caching—

Cache read price—

ModeratedNo

Open weightsNousResearch/Hermes-4-405B ↗

Hermes 4 405B FAQ

How much does Hermes 4 405B cost?

Hermes 4 405B costs $1.00 per million input tokens and $3.00 per million output tokens via OpenRouter, making it 228th cheapest of 332 paid models.

What is Hermes 4 405B's context window?

Hermes 4 405B supports a 131K-token context window. It accepts text input.

More from nousresearch

All Hermes 4 405B alternatives →

nousresearch/hermes-4-70b

Intel —$0.130/M

nousresearch/hermes-3-llama-3.1-70b

Intel 5.1$0.700/M

nousresearch/hermes-3-llama-3.1-405b

Intel —$1.00/M

Compare head-to-head

Nous: Hermes 4 405B vs Nous: Hermes 4 70B Nous: Hermes 4 405B vs Nous: Hermes 3 70B Instruct Nous: Hermes 4 405B vs Nous: Hermes 3 405B Instruct