Granite 4.1 8B is the best IBM model for coding, with a 7.3 Artificial Analysis Coding Index across benchmarks like SWE-bench and SciCode. Granite 4.0 Micro (5.0) is next.
AI models ranked by the Artificial Analysis Coding Index, measuring real-world software engineering ability across benchmarks like SWE-bench, SciCode and terminal tasks. The best LLMs for code generation, debugging and agentic development.
Granite 4.1 8B is the best IBM model for coding, with a 7.3 Artificial Analysis Coding Index across benchmarks like SWE-bench and SciCode. Granite 4.0 Micro (5.0) is next.
Granite 4.0 Micro (5.0) is the closest alternative on this metric. See the full ranking above for the tradeoffs.
modelgrep tracks 2 IBM models with live benchmarks, speed, latency and per-provider pricing, led on intelligence by Granite 4.1 8B. 2 of them qualify for this ranking.