modelgrep

Anthropic: Claude Sonnet 4.5 vs StepFun: Step 3.5 Flash

StepFun: Step 3.5 Flash wins on more metrics (5 of 9), but the right pick depends on what you optimize for — see the breakdown below.

MetricAnthropic: Claude Sonnet 4.5StepFun: Step 3.5 Flash
Intelligence Index37.137.8
Coding Index33.531.6
GPQA Diamond73%83%
Design Arena Elo1242
Speed (tokens/sec)4544
Latency787ms418ms
Input price /M$3.00$0.090
Output price /M$15.00$0.300
Context window1M262K
CapabilitiesReasoningToolsJSONVisionReasoningToolsJSON