MEWC

reasoning

MEWC is a benchmark that evaluates AI model performance on multi-environment web challenges, testing agents' ability to navigate and complete complex tasks across diverse web environments.

Leaderboard

Showing 1 of 1 result

MiniMax M2.5

74.4%

i