MLE-Bench Lite

coding

MLE-Bench Lite evaluates AI agents on machine learning engineering tasks, testing their ability to build, train, and optimize ML models for Kaggle-style competitions in a lightweight evaluation format.

Leaderboard

Showing 1 of 1 result

MiniMax M2.7

66.6%

i