Llama 4 Maverick is the best Meta model for coding, with a 15.6 Artificial Analysis Coding Index across benchmarks like SWE-bench and SciCode. Llama 3.1 70B Instruct (10.9) and Llama 3.3 70B Instruct (free) (10.7) round out the top three.
AI models ranked by the Artificial Analysis Coding Index, measuring real-world software engineering ability across benchmarks like SWE-bench, SciCode and terminal tasks. The best LLMs for code generation, debugging and agentic development.
Llama 4 Maverick is the best Meta model for coding, with a 15.6 Artificial Analysis Coding Index across benchmarks like SWE-bench and SciCode. Llama 3.1 70B Instruct (10.9) and Llama 3.3 70B Instruct (free) (10.7) round out the top three.
Llama 3.1 70B Instruct (10.9) is the closest alternative on this metric, followed by Llama 3.3 70B Instruct (free) (10.7). See the full ranking above for the tradeoffs.
modelgrep tracks 13 Meta models with live benchmarks, speed, latency and per-provider pricing, led on intelligence by Llama 4 Maverick. 10 of them qualify for this ranking.