meta-llama/llama-3.1-70b-instruct
Meta's latest class of model (Llama 3.1) launched with a variety of sizes & flavors. This 70B instruct-tuned version is optimized for high quality dialogue usecases. It has demonstrated strong...
| Provider | In $/M | Out $/M | Context | Uptime |
|---|---|---|---|---|
| DeepInfrafp8 | $0.400 | $0.400 | 131K | 99.6% |
| Amazon Bedrock | $0.720 | $0.720 | 131K | 99.7% |
| WandBbf16 | $0.800 | $0.800 | 128K | 100% |
Llama 3.1 70B Instruct costs $0.400 per million input tokens and $0.400 per million output tokens via OpenRouter, making it 153rd cheapest of 298 paid models.
Llama 3.1 70B Instruct scores 12.5 on the Artificial Analysis Intelligence Index, ranking 157th of 178 benchmarked models, with a GPQA Diamond score of 41%.
Llama 3.1 70B Instruct generates around 28 tokens per second with 303ms time-to-first-token (p50), the 245th fastest tracked model.
Llama 3.1 70B Instruct supports a 131K-token context window and can output up to 16K tokens. It accepts text input.