GroundUI-1K

multimodal

A subset of GroundUI-18K for UI grounding evaluation, where models must predict action coordinates on screenshots based on single-step instructions across web, desktop, and mobile platforms.

Leaderboard

Showing 2 of 2 results

Nova Pro

81.4%

i
Nova Lite

80.2%

i