mistralai/mistral-small-24b-instruct-2501
Mistral Small 3 is a 24B-parameter language model optimized for low-latency performance across common AI tasks. Released under the Apache 2.0 license, it features both pre-trained and instruction-tuned versions designed...
| Provider | In $/M | Out $/M | Context | Uptime |
|---|---|---|---|---|
| DeepInfrafp8 | $0.050 | $0.080 | 33K | 100% |
Mistral Small 3 costs $0.050 per million input tokens and $0.080 per million output tokens via OpenRouter, making it 21st cheapest of 298 paid models.
Mistral Small 3 generates around 47 tokens per second with 270ms time-to-first-token (p50), the 180th fastest tracked model.
Mistral Small 3 supports a 33K-token context window and can output up to 16K tokens. It accepts text input.