xiaomi/mimo-v2-flash
MiMo-V2-Flash is an open-source foundation language model developed by Xiaomi. It is a Mixture-of-Experts model with 309B total parameters and 15B active parameters, adopting hybrid attention architecture. MiMo-V2-Flash supports a...
| Provider | In $/M | Out $/M | Context | Uptime |
|---|---|---|---|---|
| Xiaomifp8 | $0.100 | $0.300 | 262K | 100% |
MiMo-V2-Flash costs $0.100 per million input tokens and $0.300 per million output tokens via OpenRouter, making it 51st cheapest of 298 paid models.
MiMo-V2-Flash scores 30.3 on the Artificial Analysis Intelligence Index, ranking 77th of 179 benchmarked models, with a GPQA Diamond score of 66%.
MiMo-V2-Flash generates around 92 tokens per second with 614ms time-to-first-token (p50), the 74th fastest tracked model.
MiMo-V2-Flash supports a 262K-token context window and can output up to 66K tokens. It accepts text input.