modelgrep

Best Mistral Models for Coding

Quick answer · Updated June 2026

Mistral Medium 3.5 is the best Mistral model for coding, with a 35.4 Artificial Analysis Coding Index across benchmarks like SWE-bench and SciCode. Devstral 2 2512 (23.7) and Mistral Large 3 2512 (22.7) round out the top three.

35.4Coding
39.2Intelligence
53 t/sSpeed
$1.50Input /M
262KContext

AI models ranked by the Artificial Analysis Coding Index, measuring real-world software engineering ability across benchmarks like SWE-bench, SciCode and terminal tasks. The best LLMs for code generation, debugging and agentic development.

  1. 1M
    mistral-medium-3-5
    ReasoningToolsJSON+139.2 intel · $1.50/M · 53 t/s
    35.4
    Coding
  2. 2M
    devstral-2512
    ToolsJSON22.0 intel · $0.400/M · 43 t/s
    23.7
    Coding
  3. 3M
    mistral-large-2512
    ToolsJSONVision22.8 intel · $0.500/M · 43 t/s
    22.7
    Coding
  4. 4M
    mistral-medium-3.1
    ToolsJSONVision21.3 intel · $0.400/M · 131K ctx
    18.3
    Coding
  5. 5M
    mistral-small-2603
    ReasoningToolsJSON+118.6 intel · $0.150/M · 117 t/s
    16.4
    Coding
  6. 6M
    mistral-medium-3
    ToolsJSONVision18.8 intel · $0.400/M · 34 t/s
    13.6
    Coding
  7. 7M
    ministral-14b-2512
    ToolsJSONVision16.0 intel · $0.200/M · 262K ctx
    10.9
    Coding
  8. 8M
    ministral-8b-2512
    ToolsJSONVision14.8 intel · $0.150/M · 11 t/s
    10.0
    Coding
  9. 9M
    ministral-3b-2512
    ToolsJSONVision11.2 intel · $0.100/M · 54 t/s
    4.8
    Coding

Frequently asked

What is the best Mistral model for coding?

Mistral Medium 3.5 is the best Mistral model for coding, with a 35.4 Artificial Analysis Coding Index across benchmarks like SWE-bench and SciCode. Devstral 2 2512 (23.7) and Mistral Large 3 2512 (22.7) round out the top three.

What's a good alternative to Mistral Medium 3.5?

Devstral 2 2512 (23.7) is the closest alternative on this metric, followed by Mistral Large 3 2512 (22.7). See the full ranking above for the tradeoffs.

How many Mistral models are there?

modelgrep tracks 19 Mistral models with live benchmarks, speed, latency and per-provider pricing, led on intelligence by Mistral Medium 3.5. 9 of them qualify for this ranking.

More Mistral rankings

All rankings