OpenBookQA

reasoning

OpenBookQA is a question-answering dataset modeled after open book exams for assessing human understanding. It contains 5,957 multiple-choice elementary-level science questions that probe understanding of 1,326 core science facts and their application to novel situations, requiring combination of open book facts with broad common knowledge through multi-hop reasoning.

Leaderboard

Showing 5 of 5 results

Phi-3.5-MoE-instruct

89.6%

i
Phi-3.5-mini-instruct

79.2%

i
Phi 4 Mini

79.2%

i
Mistral NeMo Instruct

60.6%

i
Hermes 3 70B

49.4%

i