The best LLM for reasoning is Claude Fable 5 — 64.9 intelligence with extended step-by-step thinking. Claude Opus 4.8 (61.4) and Claude Opus 4.7 (57.3) round out the top three.
Large language models with extended reasoning / thinking, ranked by intelligence. The best models for complex multi-step reasoning, math and analysis.
The best LLM for reasoning is Claude Fable 5 — 64.9 intelligence with extended step-by-step thinking. Claude Opus 4.8 (61.4) and Claude Opus 4.7 (57.3) round out the top three.
Claude Opus 4.8 (61.4) is the closest alternative on this metric, followed by Claude Opus 4.7 (57.3). See the full ranking above for the tradeoffs.