modelgrep

MiniMax: MiniMax M2.5 vs StepFun: Step 3.7 Flash

MiniMax: MiniMax M2.5 wins on more metrics (7 of 9), but the right pick depends on what you optimize for — see the breakdown below.

MetricMiniMax: MiniMax M2.5StepFun: Step 3.7 Flash
Intelligence Index41.942.6
Coding Index37.437.1
GPQA Diamond85%81%
Design Arena Elo12661232
Speed (tokens/sec)21463
Latency423ms2.2s
Input price /M$0.150$0.200
Output price /M$0.900$1.15
Context window205K256K
CapabilitiesReasoningToolsJSONReasoningToolsJSONVision