modelgrep

DeepSeek: DeepSeek V3.2 vs StepFun: Step 3.7 Flash

StepFun: Step 3.7 Flash wins on more metrics (6 of 9), but the right pick depends on what you optimize for — see the breakdown below.

MetricDeepSeek: DeepSeek V3.2StepFun: Step 3.7 Flash
Intelligence Index41.742.6
Coding Index36.737.1
GPQA Diamond84%81%
Design Arena Elo12211232
Speed (tokens/sec)4182
Latency458ms1.5s
Input price /M$0.229$0.200
Output price /M$0.343$1.15
Context window131K256K
CapabilitiesReasoningToolsJSONReasoningToolsJSONVision