Best Vision LLMs

Match · Updated July 2026

Claude Opus 5 is the best vision-capable LLM, pairing 60.7 intelligence with image and document understanding. Claude Fable 5 (59.9) and Claude Fable 5 (batch) (59.9) round out the top three.

60.7Intelligence

$5.00Input /M

1MContext

Multimodal large language models that accept image input, ranked by intelligence. The best vision language models (VLMs) for understanding images, documents and charts.

1
claude-opus-5
ReasoningToolsJSON+160.7 intel · $5.00/M · 1M ctx
60.7
Intelligence
2
claude-fable-5
ReasoningToolsJSON+159.9 intel · $10.00/M · 1M ctx
59.9
Intelligence
3
claude-fable-5:batch
ReasoningToolsJSON+159.9 intel · $5.00/M · 1M ctx
59.9
Intelligence
4
gpt-5.6-sol
ReasoningToolsJSON+158.9 intel · $5.00/M · 1.1M ctx
58.9
Intelligence
5
kimi-k3
ReasoningToolsJSON+157.1 intel · $3.00/M · 1.0M ctx
57.1
Intelligence
6
claude-opus-4.8
ReasoningToolsJSON+155.7 intel · $5.00/M · 1M ctx
55.7
Intelligence
7
claude-opus-4.8:batch
ReasoningToolsJSON+155.7 intel · $2.50/M · 1M ctx
55.7
Intelligence
8
gpt-5.6-terra
ReasoningToolsJSON+155.0 intel · $1.00/M · 1.1M ctx
55.0
Intelligence
9
gpt-5.5
ReasoningToolsJSON+154.8 intel · $5.00/M · 1.1M ctx
54.8
Intelligence
10
gpt-5.5:batch
ReasoningToolsJSON+154.8 intel · $2.50/M · 1.1M ctx
54.8
Intelligence
11
grok-4.5
ReasoningToolsJSON+153.8 intel · $2.00/M · 500K ctx
53.8
Intelligence
12
claude-opus-4.7
ReasoningToolsJSON+153.5 intel · $5.00/M · 1M ctx
53.5
Intelligence
13
claude-opus-4.7:batch
ReasoningToolsJSON+153.5 intel · $2.50/M · 1M ctx
53.5
Intelligence
14
claude-sonnet-5
ReasoningToolsJSON+153.4 intel · $2.00/M · 1M ctx
53.4
Intelligence
15
claude-sonnet-5:batch
ReasoningToolsJSON+153.4 intel · $1.00/M · 1M ctx
53.4
Intelligence
16
gpt-5.4
ReasoningToolsJSON+151.4 intel · $2.50/M · 1.1M ctx
51.4
Intelligence
17
gpt-5.4:batch
ReasoningToolsJSON+151.4 intel · $1.25/M · 1.1M ctx
51.4
Intelligence
18
gpt-5.6-luna
ReasoningToolsJSON+151.2 intel · $0.100/M · 1.1M ctx
51.2
Intelligence
19M
muse-spark-1.1
ReasoningToolsJSON+250.6 intel · $1.25/M · 1.0M ctx
50.6
Intelligence
20
gemini-3.5-flash
ReasoningToolsJSON+250.2 intel · $1.50/M · 1.0M ctx
50.2
Intelligence
21
gemini-3.5-flash:batch
ReasoningToolsJSON+250.2 intel · $0.750/M · 1.0M ctx
50.2
Intelligence
22
gemini-3.6-flash
ReasoningToolsJSON+250.1 intel · $1.50/M · 1.0M ctx
50.1
Intelligence
23
gemini-3.6-flash:batch
ReasoningToolsJSON+250.1 intel · $0.750/M · 1.0M ctx
50.1
Intelligence
24
gemini-3.1-pro-preview
ReasoningToolsJSON+246.5 intel · $2.00/M · 1.0M ctx
46.5
Intelligence
25
gemini-3.1-pro-preview:batch
ReasoningToolsJSON+246.5 intel · $1.00/M · 1.0M ctx
46.5
Intelligence

Frequently asked

What is the best LLM for vision?

Claude Opus 5 is the best vision-capable LLM, pairing 60.7 intelligence with image and document understanding. Claude Fable 5 (59.9) and Claude Fable 5 (batch) (59.9) round out the top three.

What is the best multimodal AI model?

Claude Opus 5 is the best vision-capable AI model, pairing 60.7 intelligence with image and document understanding. Claude Fable 5 (59.9) and Claude Fable 5 (batch) (59.9) round out the top three.

What's a good alternative to Claude Opus 5?

Claude Fable 5 (59.9) is the closest alternative on this metric, followed by Claude Fable 5 (batch) (59.9). See the full ranking above for the tradeoffs.

By maker

OpenAI Qwen Google Anthropic Mistral Z.ai DeepSeek NVIDIA

All rankings

Small & Fast LLMs Best Local LLMs Smartest LLMs Best LLMs for Coding Best LLMs for Design & Frontend Fastest LLMs Lowest-Latency LLMs Cheapest LLMs Best Free LLMs Best Reasoning LLMs Best LLMs for Agents Best Open-Source LLMs Longest-Context LLMs Best LLMs for Writing Best LLMs for Math & Science Best LLMs for RAG Best LLMs for SQL & Data Analysis Best LLMs for Roleplay Best Uncensored LLMs