QVHighlights

multimodal

QVHighlights is a video moment retrieval benchmark for detecting moments and highlights in videos via natural language queries. Given a query, the model must localize the start and end times of relevant moments in the video, evaluated using metrics such as Recall@1 at a 0.5 IoU threshold.

Leaderboard

Showing 3 of 3 results

Nova 2 Lite

77.2%

i
Nova 2 Omni

76.7%

i
Nova 2 Pro

76.7%

i