Qwen: Qwen3 Max Thinking wins on more metrics (4 of 7), but the right pick depends on what you optimize for — see the breakdown below.
| Metric | Mistral: Mistral Medium 3.5 | Qwen: Qwen3 Max Thinking |
|---|---|---|
| Intelligence Index | 39.2 | 39.8✓ |
| Coding Index | 35.4✓ | 30.5 |
| GPQA Diamond | 75% | 86%✓ |
| Design Arena Elo | — | — |
| Speed (tokens/sec) | 47✓ | 38 |
| Latency | 691ms✓ | 1.3s |
| Input price /M | $1.50 | $0.780✓ |
| Output price /M | $7.50 | $3.90✓ |
| Context window | 262K | 262K |
| Capabilities | ReasoningToolsJSONVision | ReasoningToolsJSON |