modelgrep

Best OpenAI Vision Models

Quick answer · Updated June 2026

GPT-5.4 is the best vision-capable OpenAI model, pairing 56.8 intelligence with image and document understanding. GPT-5.5 (56.7) and GPT-5.3-Codex (53.6) round out the top three.

56.8Intelligence
59 t/sSpeed
$2.50Input /M
1.1MContext

Multimodal large language models that accept image input, ranked by intelligence. The best vision-capable AI models for understanding images, documents and charts.

  1. 1O
    gpt-5.4
    ReasoningToolsJSON+156.8 intel · $2.50/M · 59 t/s
    56.8
    Intelligence
  2. 2O
    gpt-5.5
    ReasoningToolsJSON+156.7 intel · $5.00/M · 37 t/s
    56.7
    Intelligence
  3. 3O
    gpt-5.3-codex
    ReasoningToolsJSON+153.6 intel · $1.75/M · 46 t/s
    53.6
    Intelligence
  4. 4O
    gpt-5.2-codex
    ReasoningToolsJSON+149.0 intel · $1.75/M · 96 t/s
    49.0
    Intelligence
  5. 5O
    gpt-5.1
    ReasoningToolsJSON+147.7 intel · $1.25/M · 61 t/s
    47.7
    Intelligence
  6. 6O
    gpt-5.2
    ReasoningToolsJSON+146.6 intel · $1.75/M · 45 t/s
    46.6
    Intelligence
  7. 7O
    gpt-5-codex
    ReasoningToolsJSON+144.6 intel · $1.25/M · 55 t/s
    44.6
    Intelligence
  8. 8O
    gpt-5.4-nano
    ReasoningToolsJSON+144.0 intel · $0.200/M · 57 t/s
    44.0
    Intelligence
  9. 9O
    gpt-5.1-codex
    ReasoningToolsJSON+143.1 intel · $1.25/M · 60 t/s
    43.1
    Intelligence
  10. 10O
    gpt-5
    ReasoningToolsJSON+142.0 intel · $1.25/M · 55 t/s
    42.0
    Intelligence
  11. 11O
    gpt-5-mini
    ReasoningToolsJSON+141.2 intel · $0.250/M · 77 t/s
    41.2
    Intelligence
  12. 12O
    gpt-5.1-codex-mini
    ReasoningToolsJSON+138.6 intel · $0.250/M · 130 t/s
    38.6
    Intelligence
  13. 13O
    o3
    ReasoningToolsJSON+138.4 intel · $2.00/M · 63 t/s
    38.4
    Intelligence
  14. 14O
    o4-mini
    ReasoningToolsJSON+133.1 intel · $1.10/M · 112 t/s
    33.1
    Intelligence
  15. 15O
    gpt-4.1
    ToolsJSONVision26.3 intel · $2.00/M · 45 t/s
    26.3
    Intelligence
  16. 16O
    gpt-5-nano
    ReasoningToolsJSON+125.9 intel · $0.050/M · 106 t/s
    25.9
    Intelligence
  17. 17O
    gpt-5.4-mini
    ReasoningToolsJSON+123.3 intel · $0.750/M · 75 t/s
    23.3
    Intelligence
  18. 18O
    gpt-4.1-mini
    ToolsJSONVision22.9 intel · $0.400/M · 56 t/s
    22.9
    Intelligence
  19. 19O
    gpt-4o-2024-08-06
    ToolsJSONVision18.6 intel · $2.50/M · 23 t/s
    18.6
    Intelligence
  20. 20O
    gpt-4o-2024-11-20
    ToolsJSONVision17.3 intel · $2.50/M · 36 t/s
    17.3
    Intelligence
  21. 21O
    gpt-4.1-nano
    ToolsJSONVision13.0 intel · $0.100/M · 92 t/s
    13.0
    Intelligence
  22. 22O
    gpt-chat-latest
    ToolsJSONVision$5.00/M · 62 t/s · 1.4s ttft
    Intelligence
  23. 23O
    gpt-5.5-pro
    ReasoningToolsJSON+1$30.00/M · 34 t/s · 39.0s ttft
    Intelligence
  24. 24O
    gpt-5.4-image-2
    ReasoningJSONVision+1$8.00/M · 34 t/s · 610ms ttft
    Intelligence
  25. 25O
    gpt-5.4-pro
    ReasoningToolsJSON+1$30.00/M · 6 t/s · 28.2s ttft
    Intelligence

Frequently asked

What is the best OpenAI model for vision?

GPT-5.4 is the best vision-capable OpenAI model, pairing 56.8 intelligence with image and document understanding. GPT-5.5 (56.7) and GPT-5.3-Codex (53.6) round out the top three.

What's a good alternative to GPT-5.4?

GPT-5.5 (56.7) is the closest alternative on this metric, followed by GPT-5.3-Codex (53.6). See the full ranking above for the tradeoffs.

How many OpenAI models are there?

modelgrep tracks 62 OpenAI models with live benchmarks, speed, latency and per-provider pricing, led on intelligence by GPT-5.4. 25 of them qualify for this ranking.

More OpenAI rankings

All rankings