modelgrep

Best MiniMax Vision Models

Quick answer · Updated June 2026

MiniMax M3 is the best vision-capable MiniMax model, pairing 54.7 intelligence with image and document understanding. MiniMax-01 (—) is next.

54.7Intelligence
42 t/sSpeed
$0.300Input /M
1.0MContext

Multimodal large language models that accept image input, ranked by intelligence. The best vision-capable AI models for understanding images, documents and charts.

  1. 1M
    minimax-m3
    ReasoningToolsJSON+154.7 intel · $0.300/M · 42 t/s
    54.7
    Intelligence
  2. 2M
    minimax-01
    Vision$0.200/M · 34 t/s · 823ms ttft
    Intelligence

Frequently asked

What is the best MiniMax model for vision?

MiniMax M3 is the best vision-capable MiniMax model, pairing 54.7 intelligence with image and document understanding. MiniMax-01 (—) is next.

What's a good alternative to MiniMax M3?

MiniMax-01 (—) is the closest alternative on this metric. See the full ranking above for the tradeoffs.

How many MiniMax models are there?

modelgrep tracks 8 MiniMax models with live benchmarks, speed, latency and per-provider pricing, led on intelligence by MiniMax M3. 2 of them qualify for this ranking.

More MiniMax rankings

All rankings