Meta: Llama 3.2 3B Instruct

meta-llama/llama-3.2-3b-instruct

Cheaper than 94% of paidJSON

Use via OpenRouter ↗

Intelligence

—

Design Elo

—

Speed

—

tokens/sec

Latency

—

first token

Input price

$0.050

21st cheapest

Context

131K

131K max out

How it compares

Cheaper than94%

of all ranked models

Overview

Llama 3.2 3B is a 3-billion-parameter multilingual large language model, optimized for advanced natural language processing tasks like dialogue generation, reasoning, and summarization. Designed with the latest transformer architecture, it...

Providers & pricing (2)

Provider	In $/M	Out $/M	Context	Uptime
Parasailbf16	$0.050	$0.330	131K	100%
Cloudflare	$0.051	$0.335	80K	100%

Specifications

Context window131K

Max output131K

Knowledge cutoffDec 2023

Input modalitiestext

Output modalitiestext

Prompt caching—

Cache read price—

ModeratedNo

Open weightsmeta-llama/Llama-3.2-3B-Instruct ↗

Llama 3.2 3B Instruct FAQ

How much does Llama 3.2 3B Instruct cost?

Llama 3.2 3B Instruct costs $0.050 per million input tokens and $0.330 per million output tokens via OpenRouter, making it 21st cheapest of 332 paid models.