Llama 3.3 70B Instruct (free) is the best Meta model for agents, scoring 9.1 on the Artificial Analysis Agentic Index for tool use and multi-step task completion. Llama 3.3 70B Instruct (9.1) and Llama 4 Maverick (7.2) round out the top three.
AI models ranked by the Artificial Analysis Agentic Index — measuring multi-step tool use, planning and task completion (including Tau²-Bench). The best models for building autonomous agents and agentic workflows.
Llama 3.3 70B Instruct (free) is the best Meta model for agents, scoring 9.1 on the Artificial Analysis Agentic Index for tool use and multi-step task completion. Llama 3.3 70B Instruct (9.1) and Llama 4 Maverick (7.2) round out the top three.
Llama 3.3 70B Instruct (9.1) is the closest alternative on this metric, followed by Llama 4 Maverick (7.2). See the full ranking above for the tradeoffs.
modelgrep tracks 13 Meta models with live benchmarks, speed, latency and per-provider pricing, led on intelligence by Llama 4 Maverick. 10 of them qualify for this ranking.