Qwen3.7 Max is the best Qwen model for agents, scoring 66.6 on the Artificial Analysis Agentic Index for tool use and multi-step task completion. Qwen3.7 Plus (65.1) and Qwen3.6 Max Preview (64.8) round out the top three.
AI models ranked by the Artificial Analysis Agentic Index — measuring multi-step tool use, planning and task completion (including Tau²-Bench). The best models for building autonomous agents and agentic workflows.
Qwen3.7 Max is the best Qwen model for agents, scoring 66.6 on the Artificial Analysis Agentic Index for tool use and multi-step task completion. Qwen3.7 Plus (65.1) and Qwen3.6 Max Preview (64.8) round out the top three.
Qwen3.7 Plus (65.1) is the closest alternative on this metric, followed by Qwen3.6 Max Preview (64.8). See the full ranking above for the tradeoffs.
modelgrep tracks 49 Qwen models with live benchmarks, speed, latency and per-provider pricing, led on intelligence by Qwen3.7 Max. 25 of them qualify for this ranking.