modelgrep

Anthropic: Claude Sonnet 4.6 vs StepFun: Step 3.7 Flash

Anthropic: Claude Sonnet 4.6 wins on more metrics (5 of 8), but the right pick depends on what you optimize for — see the breakdown below.

MetricAnthropic: Claude Sonnet 4.6StepFun: Step 3.7 Flash
Intelligence Index42.642.6
Coding Index43.037.1
GPQA Diamond80%81%
Design Arena Elo13281232
Speed (tokens/sec)46
Latency1.1s
Input price /M$3.00$0.200
Output price /M$15.00$1.15
Context window1M256K
CapabilitiesReasoningToolsJSONVisionReasoningToolsJSONVision