Gemini 3.5 Flash is the best Google model for coding, with a 47.1 Artificial Analysis Coding Index across benchmarks like SWE-bench and SciCode. Gemini 3 Flash Preview (42.6) and Gemini 3.1 Pro Preview (39.4) round out the top three.
AI models ranked by the Artificial Analysis Coding Index, measuring real-world software engineering ability across benchmarks like SWE-bench, SciCode and terminal tasks. The best LLMs for code generation, debugging and agentic development.
Gemini 3.5 Flash is the best Google model for coding, with a 47.1 Artificial Analysis Coding Index across benchmarks like SWE-bench and SciCode. Gemini 3 Flash Preview (42.6) and Gemini 3.1 Pro Preview (39.4) round out the top three.
Gemini 3 Flash Preview (42.6) is the closest alternative on this metric, followed by Gemini 3.1 Pro Preview (39.4). See the full ranking above for the tradeoffs.
modelgrep tracks 26 Google models with live benchmarks, speed, latency and per-provider pricing, led on intelligence by Gemini 3 Flash Preview. 14 of them qualify for this ranking.