modelgrep

Best xAI Models for Coding

Quick answer · Updated June 2026

Grok 4.3 is the best xAI model for coding, with a 41.0 Artificial Analysis Coding Index across benchmarks like SWE-bench and SciCode. Grok 4.20 (25.4) is next.

41.0Coding
53.2Intelligence
155 t/sSpeed
$1.25Input /M
1MContext

AI models ranked by the Artificial Analysis Coding Index, measuring real-world software engineering ability across benchmarks like SWE-bench, SciCode and terminal tasks. The best LLMs for code generation, debugging and agentic development.

  1. 1X
    grok-4.3
    ReasoningToolsJSON+153.2 intel · $1.25/M · 155 t/s
    41.0
    Coding
  2. 2X
    grok-4.20
    ReasoningToolsJSON+129.7 intel · $1.25/M · 76 t/s
    25.4
    Coding

Frequently asked

What is the best xAI model for coding?

Grok 4.3 is the best xAI model for coding, with a 41.0 Artificial Analysis Coding Index across benchmarks like SWE-bench and SciCode. Grok 4.20 (25.4) is next.

What's a good alternative to Grok 4.3?

Grok 4.20 (25.4) is the closest alternative on this metric. See the full ranking above for the tradeoffs.

How many xAI models are there?

modelgrep tracks 4 xAI models with live benchmarks, speed, latency and per-provider pricing, led on intelligence by Grok 4.3. 2 of them qualify for this ranking.

More xAI rankings

All rankings