Meta: Llama 3.3 70B Instruct

meta-llama/llama-3.3-70b-instruct

Cheaper than 79% of paidToolsJSON

Use via OpenRouter ↗

Intelligence

—

Design Elo

—

Speed

—

tokens/sec

Latency

—

first token

Input price

$0.130

69th cheapest

Context

131K

128K max out

How it compares

Cheaper than79%

of all ranked models

Overview

The Meta Llama 3.3 multilingual large language model (LLM) is a pretrained and instruction tuned generative model in 70B (text in/text out). The Llama 3.3 instruction tuned text only model...

Providers & pricing (13)

Provider	In $/M	Out $/M	Context	Uptime
DeepInfrafp8	$0.100	$0.320	131K	94.5%
Nebiusfp8	$0.130	$0.400	131K	56.3%
AkashMLfp8	$0.130	$0.400	131K	99.5%
Novitabf16	$0.135	$0.400	6K	97.3%
Parasailfp8	$0.220	$0.500	131K	94.6%
Crusoebf16	$0.250	$0.750	131K	100%
Cloudflarefp8	$0.293	$2.25	24K	97.5%
SambaNovabf16	$0.450	$0.900	16K	89.5%
Groq	$0.590	$0.790	131K	100%
CoreWeavefp16	$0.710	$0.710	128K	100%
Google	$0.720	$0.720	128K	98.4%
Google	$0.720	$0.720	128K	100%
Togetherfp8	$1.04	$1.04	131K	99.6%

Specifications

Context window131K

Max output128K

Knowledge cutoffDec 2023

Input modalitiestext

Output modalitiestext

Prompt caching—

Cache read price—

ModeratedNo

Open weightsmeta-llama/Llama-3.3-70B-Instruct ↗

Llama 3.3 70B Instruct FAQ

How much does Llama 3.3 70B Instruct cost?

Llama 3.3 70B Instruct costs $0.130 per million input tokens and $0.400 per million output tokens via OpenRouter, making it 69th cheapest of 332 paid models.