LLM Agent Leaderboard

LLM Agent Leaderboard — 10 items ranking table
RankNameDescription
🥇Claude Fable 5 (High)By Anthropic · net improvement 13.34%
🥈Claude Opus 4.8 (Thinking)By Anthropic · net improvement 9.37%
🥉GPT 5.5 (xHigh)By OpenAI · net improvement 8.21%
4.Claude Opus 4.7By Anthropic · net improvement 8.16%
5.Claude Opus 4.7 (Thinking)By Anthropic · net improvement 8.07%
6.GPT 5.5 (High)By OpenAI · net improvement 7.13%
7.GLM 5.2 (Max)By Zhipu AI · net improvement 6.93%
8.GPT 5.4 (High)By OpenAI · net improvement 6.65%
9.Claude Opus 4.6By Anthropic · net improvement 6.47%
10.GPT 5.5By OpenAI · net improvement 6.22%