modelgrep

Best Meta Models for Agents

Quick answer · Updated June 2026

Llama 3.3 70B Instruct (free) is the best Meta model for agents, scoring 9.1 on the Artificial Analysis Agentic Index for tool use and multi-step task completion. Llama 3.3 70B Instruct (9.1) and Llama 4 Maverick (7.2) round out the top three.

9.1Agentic
14.5Intelligence
115 t/sSpeed
FreeInput /M
131KContext

AI models ranked by the Artificial Analysis Agentic Index — measuring multi-step tool use, planning and task completion (including Tau²-Bench). The best models for building autonomous agents and agentic workflows.

  1. 1M
    llama-3.3-70b-instruct:free
    Tools14.5 intel · Free/M · 115 t/s
    9.1
    Agentic
  2. 2M
    llama-3.3-70b-instruct
    ToolsJSON14.5 intel · $0.100/M · 115 t/s
    9.1
    Agentic
  3. 3M
    llama-4-maverick
    ToolsJSONVision18.4 intel · $0.150/M · 72 t/s
    7.2
    Agentic
  4. 4M
    llama-3.1-8b-instruct
    ToolsJSON11.8 intel · $0.020/M · 145 t/s
    5.5
    Agentic
  5. 5M
    llama-4-scout
    ToolsJSONVision13.5 intel · $0.100/M · 130 t/s
    5.2
    Agentic
  6. 6M
    llama-3.1-70b-instruct
    ToolsJSON12.5 intel · $0.400/M · 28 t/s
    5.1
    Agentic
  7. 7M
    llama-3.2-11b-vision-instruct
    JSONVision8.7 intel · $0.345/M · 35 t/s
    4.9
    Agentic
  8. 8M
    llama-3.2-1b-instruct
    6.3 intel · $0.027/M · 169 t/s
    0.0
    Agentic
  9. 9M
    llama-3-8b-instruct
    6.4 intel · $0.140/M · 63 t/s
    0.0
    Agentic
  10. 10M
    llama-3-70b-instruct
    JSON8.9 intel · $0.510/M · 18 t/s
    0.0
    Agentic

Frequently asked

What is the best Meta model for agents?

Llama 3.3 70B Instruct (free) is the best Meta model for agents, scoring 9.1 on the Artificial Analysis Agentic Index for tool use and multi-step task completion. Llama 3.3 70B Instruct (9.1) and Llama 4 Maverick (7.2) round out the top three.

What's a good alternative to Llama 3.3 70B Instruct (free)?

Llama 3.3 70B Instruct (9.1) is the closest alternative on this metric, followed by Llama 4 Maverick (7.2). See the full ranking above for the tradeoffs.

How many Meta models are there?

modelgrep tracks 13 Meta models with live benchmarks, speed, latency and per-provider pricing, led on intelligence by Llama 4 Maverick. 10 of them qualify for this ranking.

More Meta rankings

All rankings