modelgrep
S

Sao10K: Llama 3 8B Lunaris

sao10k/l3-lunaris-8b

Cheaper than 96% of paidJSON
Use via OpenRouter ↗
Intelligence
Design Elo
Speed
69
112th fastest
Latency
148ms
first token
Input price
$0.040
12th cheapest
Context
8K
16K max out

How it compares

Faster than62%
of all ranked models
Cheaper than96%
of all ranked models

Overview

Lunaris 8B is a versatile generalist and roleplaying model based on Llama 3. It's a strategic merge of multiple models, designed to balance creativity with improved logic and general knowledge....

Providers & pricing (2)

ProviderIn $/MOut $/MUptime
DeepInfrafp8$0.040$0.050100%
Novitabf16$0.050$0.050100%

Specifications

Context window8K
Max output16K
Knowledge cutoffDec 2023
Input modalitiestext
Output modalitiestext
Prompt caching
Cache read price
ModeratedNo

Llama 3 8B Lunaris FAQ

How much does Llama 3 8B Lunaris cost?

Llama 3 8B Lunaris costs $0.040 per million input tokens and $0.050 per million output tokens via OpenRouter, making it 12th cheapest of 298 paid models.

How fast is Llama 3 8B Lunaris?

Llama 3 8B Lunaris generates around 69 tokens per second with 148ms time-to-first-token (p50), the 112th fastest tracked model.

What is Llama 3 8B Lunaris's context window?

Llama 3 8B Lunaris supports a 8K-token context window and can output up to 16K tokens. It accepts text input.

Compare head-to-head