Hermes 4 405B is the best Nous model for agents, scoring 12.6 on the Artificial Analysis Agentic Index for tool use and multi-step task completion. Hermes 3 405B Instruct (free) (11.8) and Hermes 3 405B Instruct (11.8) round out the top three.
AI models ranked by the Artificial Analysis Agentic Index — measuring multi-step tool use, planning and task completion (including Tau²-Bench). The best models for building autonomous agents and agentic workflows.
Hermes 4 405B is the best Nous model for agents, scoring 12.6 on the Artificial Analysis Agentic Index for tool use and multi-step task completion. Hermes 3 405B Instruct (free) (11.8) and Hermes 3 405B Instruct (11.8) round out the top three.
Hermes 3 405B Instruct (free) (11.8) is the closest alternative on this metric, followed by Hermes 3 405B Instruct (11.8). See the full ranking above for the tradeoffs.
modelgrep tracks 5 Nous models with live benchmarks, speed, latency and per-provider pricing, led on intelligence by Hermes 4 405B. 5 of them qualify for this ranking.