Social IQa
reasoning official site →
The first large-scale benchmark for commonsense reasoning about social situations. Contains 38,000 multiple choice questions probing emotional and social intelligence in everyday situations, testing commonsense understanding of social interactions and theory of mind reasoning about the implied emotions and behavior of others.
Methodology
Imported from llm-stats public benchmark metadata. Modality: text. Max score: 1. Categories: creativity, psychology, reasoning. Language: en. Verified by llm-stats: no.