minimax/minimax-m1
MiniMax-M1 is a large-scale, open-weight reasoning model designed for extended context and high-efficiency inference. It leverages a hybrid Mixture-of-Experts (MoE) architecture paired with a custom "lightning attention" mechanism, allowing it...
| Provider | In $/M | Out $/M | Context | Uptime |
|---|---|---|---|---|
| Minimax | $0.400 | $2.20 | 1M | — |
| Novitabf16 | $0.550 | $2.20 | 1M | — |
MiniMax M1 costs $0.400 per million input tokens and $2.20 per million output tokens via OpenRouter, making it 149th cheapest of 298 paid models.
MiniMax M1 generates around 18 tokens per second with 840ms time-to-first-token (p50), the 266th fastest tracked model.
MiniMax M1 supports a 1M-token context window and can output up to 40K tokens. It accepts text input.