TheoremQA

math

A theorem-driven question answering dataset containing 800 high-quality questions covering 350+ theorems from Math, Physics, EE&CS, and Finance. Designed to evaluate AI models' capabilities to apply theorems to solve challenging university-level science problems.

Leaderboard

Showing 6 of 6 results

Qwen2 72B Instruct

44.4%

i
Qwen2.5 32B Instruct

44.1%

i
Qwen2.5-Coder 32B Instruct

43.1%

i
Qwen2.5 14B Instruct

43.0%

i
Qwen2.5-Coder 7B Instruct

34.0%

i
Qwen2 7B Instruct

25.3%

i