Qwen: Qwen3.5-9B wins on more metrics (5 of 9), but the right pick depends on what you optimize for — see the breakdown below.
| Metric | OpenAI: o4 Mini | Qwen: Qwen3.5-9B |
|---|---|---|
| Intelligence Index | 33.1✓ | 32.4 |
| Coding Index | 25.6✓ | 25.3 |
| GPQA Diamond | 78% | 81%✓ |
| Design Arena Elo | 1072✓ | — |
| Speed (tokens/sec) | 125✓ | 68 |
| Latency | 4.7s | 362ms✓ |
| Input price /M | $1.10 | $0.100✓ |
| Output price /M | $4.40 | $0.150✓ |
| Context window | 200K | 262K✓ |
| Capabilities | ReasoningToolsJSONVision | ReasoningToolsJSONVision |