DRACO

reasoning

DRACO is a deep research benchmark that evaluates an agent's ability to gather, synthesize, and reason over information to answer complex research questions. Scores are based on official rubrics per question, with the final score being the average across all questions.

Leaderboard

Showing 1 of 1 result

MiniMax M3

73.2%

i