MM IF-Eval

reasoning

A challenging multimodal instruction-following benchmark that includes both compose-level constraints for output responses and perception-level constraints tied to input images, with comprehensive evaluation pipeline.

Leaderboard

Showing 1 of 1 result

Pixtral-12B

52.7%

i