ResearchClawBench

agents

ResearchClawBench evaluates research agents on realistic, tool-using research tasks that require code execution and filesystem workspace interaction.

Leaderboard

Showing 1 of 1 result