modelgrep

Best Vision LLMs

Quick answer · Updated June 2026

Claude Fable 5 is the best vision-capable LLM, pairing 64.9 intelligence with image and document understanding. Claude Opus 4.8 (61.4) and Claude Opus 4.7 (57.3) round out the top three.

64.9Intelligence
$10.00Input /M
1MContext

Multimodal large language models that accept image input, ranked by intelligence. The best vision-capable AI models for understanding images, documents and charts.

  1. 1A
    claude-fable-5
    ReasoningToolsJSON+164.9 intel · $10.00/M · 1M ctx
    64.9
    Intelligence
  2. 2A
    claude-opus-4.8
    ReasoningToolsJSON+161.4 intel · $5.00/M · 58 t/s
    61.4
    Intelligence
  3. 3A
    claude-opus-4.7
    ReasoningToolsJSON+157.3 intel · $5.00/M · 60 t/s
    57.3
    Intelligence
  4. 4O
    gpt-5.5
    ReasoningToolsJSON+156.7 intel · $5.00/M · 37 t/s
    56.7
    Intelligence
  5. 5M
    minimax-m3
    ReasoningToolsJSON+154.7 intel · $0.300/M · 47 t/s
    54.7
    Intelligence
  6. 6O
    gpt-5.3-codex
    ReasoningToolsJSON+153.6 intel · $1.75/M · 45 t/s
    53.6
    Intelligence
  7. 7Q
    qwen3.7-plus
    ReasoningToolsJSON+153.3 intel · $0.320/M · 29 t/s
    53.3
    Intelligence
  8. 8X
    grok-4.3
    ReasoningToolsJSON+153.2 intel · $1.25/M · 135 t/s
    53.2
    Intelligence
  9. 9A
    claude-opus-4.6
    ReasoningToolsJSON+152.9 intel · $5.00/M · 46 t/s
    52.9
    Intelligence
  10. 10Q
    qwen3.6-plus
    ReasoningToolsJSON+150.0 intel · $0.325/M · 36 t/s
    50.0
    Intelligence
  11. 11X
    mimo-v2.5
    ReasoningToolsJSON+249.0 intel · $0.140/M · 49 t/s
    49.0
    Intelligence
  12. 12O
    gpt-5.2-codex
    ReasoningToolsJSON+149.0 intel · $1.75/M · 85 t/s
    49.0
    Intelligence
  13. 13O
    gpt-5.1
    ReasoningToolsJSON+147.7 intel · $1.25/M · 55 t/s
    47.7
    Intelligence
  14. 14G
    gemini-3-flash-preview
    ReasoningToolsJSON+246.4 intel · $0.500/M · 66 t/s
    46.4
    Intelligence
  15. 15O
    gpt-5-codex
    ReasoningToolsJSON+144.6 intel · $1.25/M · 43 t/s
    44.6
    Intelligence
  16. 16O
    gpt-5.4-nano
    ReasoningToolsJSON+144.0 intel · $0.200/M · 56 t/s
    44.0
    Intelligence
  17. 17G
    gemini-3.5-flash
    ReasoningToolsJSON+243.3 intel · $1.50/M · 164 t/s
    43.3
    Intelligence
  18. 18A
    claude-opus-4.5
    ReasoningToolsJSON+143.1 intel · $5.00/M · 51 t/s
    43.1
    Intelligence
  19. 19O
    gpt-5.1-codex
    ReasoningToolsJSON+143.1 intel · $1.25/M · 55 t/s
    43.1
    Intelligence
  20. 20M
    kimi-k2.6
    ReasoningToolsJSON+142.9 intel · $0.680/M · 116 t/s
    42.9
    Intelligence
  21. 21S
    step-3.7-flash
    ReasoningToolsJSON+142.6 intel · $0.200/M · 76 t/s
    42.6
    Intelligence
  22. 22A
    claude-sonnet-4.6
    ReasoningToolsJSON+142.6 intel · $3.00/M · 48 t/s
    42.6
    Intelligence
  23. 23O
    gpt-5
    ReasoningToolsJSON+142.0 intel · $1.25/M · 58 t/s
    42.0
    Intelligence
  24. 24G
    gemini-3.1-pro-preview
    ReasoningToolsJSON+241.3 intel · $2.00/M · 100 t/s
    41.3
    Intelligence
  25. 25Q
    qwen3.5-397b-a17b
    ReasoningToolsJSON+140.1 intel · $0.390/M · 134 t/s
    40.1
    Intelligence

Frequently asked

What is the best LLM for vision?

Claude Fable 5 is the best vision-capable LLM, pairing 64.9 intelligence with image and document understanding. Claude Opus 4.8 (61.4) and Claude Opus 4.7 (57.3) round out the top three.

What's a good alternative to Claude Fable 5?

Claude Opus 4.8 (61.4) is the closest alternative on this metric, followed by Claude Opus 4.7 (57.3). See the full ranking above for the tradeoffs.

By maker

All rankings