Qwen: Qwen3.5-27B wins on more metrics (4 of 7), but the right pick depends on what you optimize for — see the breakdown below.
| Metric | Qwen: Qwen3.5-27B | StepFun: Step 3.5 Flash |
|---|---|---|
| Intelligence Index | 37.2 | 37.8✓ |
| Coding Index | 33.4✓ | 31.6 |
| GPQA Diamond | 84%✓ | 83% |
| Design Arena Elo | — | — |
| Speed (tokens/sec) | 50✓ | 47 |
| Latency | 204ms✓ | 403ms |
| Input price /M | $0.195 | $0.090✓ |
| Output price /M | $1.56 | $0.300✓ |
| Context window | 262K | 262K |
| Capabilities | ReasoningToolsJSONVision | ReasoningToolsJSON |