Global PIQA
reasoning
Global PIQA is a multilingual commonsense reasoning benchmark that evaluates physical interaction knowledge across 100 languages and cultures. It tests AI systems' understanding of physical world knowledge in diverse cultural contexts through multiple choice questions about everyday situations requiring physical commonsense.
Methodology
Imported from llm-stats public benchmark metadata. Modality: text. Max score: 1. Categories: general, physics, reasoning. Language: en. Verified by llm-stats: no.