Claude Fable 5 is the best LLM for agents, scoring 80.6 on the Artificial Analysis Agentic Index for tool use and multi-step task completion. Claude Opus 4.8 (77.8) and Claude Opus 4.7 (71.3) round out the top three.
AI models ranked by the Artificial Analysis Agentic Index — measuring multi-step tool use, planning and task completion (including Tau²-Bench). The best models for building autonomous agents and agentic workflows.
Claude Fable 5 is the best LLM for agents, scoring 80.6 on the Artificial Analysis Agentic Index for tool use and multi-step task completion. Claude Opus 4.8 (77.8) and Claude Opus 4.7 (71.3) round out the top three.
Claude Opus 4.8 (77.8) is the closest alternative on this metric, followed by Claude Opus 4.7 (71.3). See the full ranking above for the tradeoffs.