modelgrep

Mistral: Devstral 2 2512 vs Qwen: Qwen3 30B A3B Thinking 2507

Qwen: Qwen3 30B A3B Thinking 2507 wins on more metrics (6 of 8), but the right pick depends on what you optimize for — see the breakdown below.

MetricMistral: Devstral 2 2512Qwen: Qwen3 30B A3B Thinking 2507
Intelligence Index
Coding Index23.714.6
GPQA Diamond59%71%
Design Arena Elo973
Speed (tokens/sec)10161
Latency897ms452ms
Input price /M$0.400$0.080
Output price /M$2.00$0.400
Context window262K131K
CapabilitiesToolsJSONReasoningToolsJSON