Mistral: Mistral Nemo

mistralai/mistral-nemo

Cheaper than 99% of paidToolsJSON

Use via OpenRouter ↗

Intelligence

—

Design Elo

—

Speed

—

tokens/sec

Latency

—

first token

Input price

$0.019

3rd cheapest

Context

131K

16K max out

How it compares

Cheaper than99%

of all ranked models

Overview

A 12B parameter model with a 128k token context length built by Mistral in collaboration with NVIDIA. The model is multilingual, supporting English, French, German, Spanish, Italian, Portuguese, Chinese, Japanese,...

Providers & pricing (4)

Provider	In $/M	Out $/M	Context	Uptime
DeepInfrafp8	$0.019	$0.030	131K	99.4%
Novitafp8	$0.040	$0.170	60K	40.3%
Io Netfp16	$0.043	$0.165	128K	98.4%
Mistral	$0.150	$0.150	131K	100%

Specifications

Context window131K

Max output16K

Knowledge cutoffApr 2024

Input modalitiestext

Output modalitiestext

Prompt caching—

Cache read price—

ModeratedNo

Open weightsmistralai/Mistral-Nemo-Instruct-2407 ↗

Mistral Nemo FAQ

How much does Mistral Nemo cost?

Mistral Nemo costs $0.019 per million input tokens and $0.030 per million output tokens via OpenRouter, making it 3rd cheapest of 332 paid models.

What is Mistral Nemo's context window?

Mistral Nemo supports a 131K-token context window and can output up to 16K tokens. It accepts text input.

More from mistralai

All Mistral Nemo alternatives →

mistralai/mistral-medium-3-5

Intel 29.9$1.50/M

mistralai/mistral-small-2603

Intel —$0.150/M

mistralai/devstral-2512

Intel —$0.400/M

mistralai/ministral-14b-2512

Intel —$0.200/M

mistralai/ministral-8b-2512

Intel —$0.150/M

mistralai/ministral-3b-2512

Intel —$0.100/M

Compare head-to-head

Mistral: Mistral Nemo vs Mistral: Mistral Medium 3.5 Mistral: Mistral Nemo vs Mistral: Mistral Small 4 Mistral: Mistral Nemo vs Mistral: Devstral 2 2512 Mistral: Mistral Nemo vs Mistral: Ministral 3 14B 2512 Mistral: Mistral Nemo vs Mistral: Ministral 3 8B 2512 Mistral: Mistral Nemo vs Mistral: Ministral 3 3B 2512