Meta: Llama 3.1 8B Instruct

meta-llama/llama-3.1-8b-instruct

Cheaper than 93% of paidToolsJSON

Use via OpenRouter ↗

Intelligence

—

Design Elo

—

Speed

—

tokens/sec

Latency

—

first token

Input price

$0.050

22nd cheapest

Context

131K

131K max out

How it compares

Cheaper than93%

of all ranked models

Overview

Meta's latest class of model (Llama 3.1) launched with a variety of sizes & flavors. This 8B instruct-tuned version is fast and efficient. It has demonstrated strong performance compared to...

Providers & pricing (5)

Provider	In $/M	Out $/M	Context	Uptime
DeepInfrafp8	$0.020	$0.040	131K	99.9%
Novitafp8	$0.020	$0.050	16K	99.8%
Groq	$0.050	$0.080	131K	99.8%
Cloudflarefp8	$0.152	$0.287	32K	99.8%
CoreWeavebf16	$0.220	$0.220	128K	100%

Specifications

Context window131K

Max output131K

Knowledge cutoffDec 2023

Input modalitiestext

Output modalitiestext

Prompt caching—

Cache read price$0.025/M

ModeratedNo

Open weightsmeta-llama/Meta-Llama-3.1-8B-Instruct ↗

Llama 3.1 8B Instruct FAQ

How much does Llama 3.1 8B Instruct cost?

Llama 3.1 8B Instruct costs $0.050 per million input tokens and $0.080 per million output tokens via OpenRouter, making it 22nd cheapest of 332 paid models.