NQ

reasoning official site →

Natural Questions (NQ) benchmark containing real user questions issued to Google search with answers found from Wikipedia, designed for training and evaluation of automatic question answering systems

Methodology

Imported from llm-stats public benchmark metadata. Modality: text. Max score: 1. Categories: general, reasoning, search. Language: en. Verified by llm-stats: no.

Leaderboard

  1. Granite 3.3 8B Base self-reported llm-stats
    36.5%