modelgrep

Best Qwen Vision Models

Quick answer · Updated June 2026

Qwen3.7 Plus is the best vision-capable Qwen model, pairing 53.3 intelligence with image and document understanding. Qwen3.6 Plus (50.0) and Qwen3.5-122B-A10B (41.6) round out the top three.

53.3Intelligence
27 t/sSpeed
$0.320Input /M
1MContext

Multimodal large language models that accept image input, ranked by intelligence. The best vision-capable AI models for understanding images, documents and charts.

  1. 1Q
    qwen3.7-plus
    ReasoningToolsJSON+153.3 intel · $0.320/M · 27 t/s
    53.3
    Intelligence
  2. 2Q
    qwen3.6-plus
    ReasoningToolsJSON+150.0 intel · $0.325/M · 36 t/s
    50.0
    Intelligence
  3. 3Q
    qwen3.5-122b-a10b
    ReasoningToolsJSON+141.6 intel · $0.260/M · 88 t/s
    41.6
    Intelligence
  4. 4Q
    qwen3.5-397b-a17b
    ReasoningToolsJSON+140.1 intel · $0.390/M · 159 t/s
    40.1
    Intelligence
  5. 5Q
    qwen3.5-27b
    ReasoningToolsJSON+137.2 intel · $0.195/M · 65 t/s
    37.2
    Intelligence
  6. 6Q
    qwen3.6-27b
    ReasoningToolsJSON+137.1 intel · $0.288/M · 74 t/s
    37.1
    Intelligence
  7. 7Q
    qwen3.5-9b
    ReasoningToolsJSON+132.4 intel · $0.100/M · 82 t/s
    32.4
    Intelligence
  8. 8Q
    qwen3.6-35b-a3b
    ReasoningToolsJSON+131.5 intel · $0.150/M · 177 t/s
    31.5
    Intelligence
  9. 9Q
    qwen3.5-35b-a3b
    ReasoningToolsJSON+130.7 intel · $0.140/M · 157 t/s
    30.7
    Intelligence
  10. 10Q
    qwen3-vl-235b-a22b-thinking
    ReasoningToolsJSON+127.6 intel · $0.260/M · 43 t/s
    27.6
    Intelligence
  11. 11Q
    qwen3-vl-32b-instruct
    ToolsJSONVision24.7 intel · $0.104/M · 51 t/s
    24.7
    Intelligence
  12. 12Q
    qwen3-vl-30b-a3b-thinking
    ReasoningToolsJSON+119.7 intel · $0.130/M · 69 t/s
    19.7
    Intelligence
  13. 13Q
    qwen3-vl-235b-a22b-instruct
    ToolsJSONVision17.0 intel · $0.200/M · 35 t/s
    17.0
    Intelligence
  14. 14Q
    qwen3-vl-8b-thinking
    ReasoningToolsJSON+116.7 intel · $0.117/M · 120 t/s
    16.7
    Intelligence
  15. 15Q
    qwen3-vl-30b-a3b-instruct
    ToolsJSONVision16.0 intel · $0.130/M · 48 t/s
    16.0
    Intelligence
  16. 16Q
    qwen3-vl-8b-instruct
    ToolsJSONVision14.3 intel · $0.080/M · 59 t/s
    14.3
    Intelligence
  17. 17Q
    qwen3.5-plus-20260420
    ReasoningToolsJSON+1$0.300/M · 51 t/s · 1.5s ttft
    Intelligence
  18. 18Q
    qwen3.6-flash
    ReasoningToolsJSON+1$0.188/M · 112 t/s · 727ms ttft
    Intelligence
  19. 19Q
    qwen3.5-flash-02-23
    ReasoningToolsJSON+1$0.065/M · 76 t/s · 607ms ttft
    Intelligence
  20. 20Q
    qwen3.5-plus-02-15
    ReasoningToolsJSON+1$0.260/M · 41 t/s · 1.6s ttft
    Intelligence
  21. 21Q
    qwen2.5-vl-72b-instruct
    JSONVision$0.800/M · 14 t/s · 1.2s ttft
    Intelligence

Frequently asked

What is the best Qwen model for vision?

Qwen3.7 Plus is the best vision-capable Qwen model, pairing 53.3 intelligence with image and document understanding. Qwen3.6 Plus (50.0) and Qwen3.5-122B-A10B (41.6) round out the top three.

What's a good alternative to Qwen3.7 Plus?

Qwen3.6 Plus (50.0) is the closest alternative on this metric, followed by Qwen3.5-122B-A10B (41.6). See the full ranking above for the tradeoffs.

How many Qwen models are there?

modelgrep tracks 49 Qwen models with live benchmarks, speed, latency and per-provider pricing, led on intelligence by Qwen3.7 Max. 21 of them qualify for this ranking.

More Qwen rankings

All rankings