APEX-Agents

reasoning

APEX-Agents is a benchmark evaluating AI agents on long horizon professional tasks that require sustained reasoning, planning, and execution across complex multi-step workflows.

Leaderboard

Showing 3 of 3 results

Gemini 3.1 Pro

33.5%

i
Kimi K2.6

27.9%

i
MiniMax M3

27.7%

i