modelgrep

Microsoft: Phi 4 vs Qwen: Qwen3 8B

Microsoft: Phi 4 wins on more metrics (5 of 8), but the right pick depends on what you optimize for — see the breakdown below.

MetricMicrosoft: Phi 4Qwen: Qwen3 8B
Intelligence Index10.410.6
Coding Index11.27.1
GPQA Diamond57%45%
Design Arena Elo
Speed (tokens/sec)6530
Latency214ms656ms
Input price /M$0.065$0.050
Output price /M$0.140$0.400
Context window16K131K
CapabilitiesJSONReasoningToolsJSON