AutomationBench

reasoning

AutomationBench is a tool-use benchmark that evaluates AI agents on automating real-world workflows, testing their ability to orchestrate tools and complete multi-step automation tasks.

Leaderboard

Showing 1 of 1 result

Claude Fable 5

17.4%

i