Global PIQA

reasoning

Global PIQA is a multilingual commonsense reasoning benchmark that evaluates physical interaction knowledge across 100 languages and cultures. It tests AI systems' understanding of physical world knowledge in diverse cultural contexts through multiple choice questions about everyday situations requiring physical commonsense.

Leaderboard

Showing 12 of 12 results

Gemini 3 Pro

93.4%

i
Gemini 3 Flash

92.8%

i
Qwen3.7 Max

91.4%

i
Qwen3.5-397B-A17B

89.8%

i
Qwen3.6 Plus

89.8%

i
Qwen3.5-122B-A10B

88.4%

i
Qwen3.5-27B

87.5%

i
Qwen3.5-35B-A3B

86.6%

i
Qwen3.5-9B

83.2%

i
Qwen3.5-4B

78.9%

i
Qwen3.5-2B

69.3%

i
Qwen3.5-0.8B

59.4%

i