The best xAI model for reasoning is Grok 4.3 — 53.2 intelligence with extended step-by-step thinking. Grok 4.20 (29.7) and Grok Build 0.1 (—) round out the top three.
Large language models with extended reasoning / thinking, ranked by intelligence. The best models for complex multi-step reasoning, math and analysis.
The best xAI model for reasoning is Grok 4.3 — 53.2 intelligence with extended step-by-step thinking. Grok 4.20 (29.7) and Grok Build 0.1 (—) round out the top three.
Grok 4.20 (29.7) is the closest alternative on this metric, followed by Grok Build 0.1 (—). See the full ranking above for the tradeoffs.
modelgrep tracks 4 xAI models with live benchmarks, speed, latency and per-provider pricing, led on intelligence by Grok 4.3. 4 of them qualify for this ranking.