modelgrep
M

Meta: Llama 3 8B Instruct

meta-llama/llama-3-8b-instruct

176th smartest of 178Cheaper than 75% of paid
Use via OpenRouter ↗
Intelligence
6.4
176th of 178
Design Elo
Speed
63
131st fastest
Latency
660ms
first token
Input price
$0.140
75th cheapest
Context
8K

How it compares

Smarter than1%
of all ranked models
Faster than56%
of all ranked models
Cheaper than75%
of all ranked models

Overview

Meta's latest class of model (Llama 3) launched with a variety of sizes & flavors. This 8B instruct-tuned version was optimized for high quality dialogue usecases. It has demonstrated strong...

Benchmarks

independent · via OpenRouter
Artificial Analysis2th percentile
Intelligence Index
6.4
Coding Index
4.0
Agentic Index
0.0
GPQA Diamond
30%
Humanity's Last Exam
5%
SciCode
12%
Tau²-Bench (agentic)
0%

Providers & pricing (1)

ProviderIn $/MOut $/MUptime
Togetherint4$0.140$0.140100%

Specifications

Context window8K
Max output
Knowledge cutoffDec 2023
Input modalitiestext
Output modalitiestext
Prompt caching
Cache read price
ModeratedNo

Llama 3 8B Instruct FAQ

How much does Llama 3 8B Instruct cost?

Llama 3 8B Instruct costs $0.140 per million input tokens and $0.140 per million output tokens via OpenRouter, making it 75th cheapest of 298 paid models.

How smart is Llama 3 8B Instruct?

Llama 3 8B Instruct scores 6.4 on the Artificial Analysis Intelligence Index, ranking 176th of 178 benchmarked models, with a GPQA Diamond score of 30%.

How fast is Llama 3 8B Instruct?

Llama 3 8B Instruct generates around 63 tokens per second with 660ms time-to-first-token (p50), the 131st fastest tracked model.

What is Llama 3 8B Instruct's context window?

Llama 3 8B Instruct supports a 8K-token context window. It accepts text input.

Compare head-to-head