GPT-5.5 is the best OpenAI model for agents, scoring 69.4 on the Artificial Analysis Agentic Index for tool use and multi-step task completion. GPT-5.4 (68.0) and GPT-5.3-Codex (60.5) round out the top three.
AI models ranked by the Artificial Analysis Agentic Index — measuring multi-step tool use, planning and task completion (including Tau²-Bench). The best models for building autonomous agents and agentic workflows.
GPT-5.5 is the best OpenAI model for agents, scoring 69.4 on the Artificial Analysis Agentic Index for tool use and multi-step task completion. GPT-5.4 (68.0) and GPT-5.3-Codex (60.5) round out the top three.
GPT-5.4 (68.0) is the closest alternative on this metric, followed by GPT-5.3-Codex (60.5). See the full ranking above for the tradeoffs.
modelgrep tracks 62 OpenAI models with live benchmarks, speed, latency and per-provider pricing, led on intelligence by GPT-5.4. 25 of them qualify for this ranking.