EQ-Bench

reasoning

EQ-Bench is an LLM-judged test evaluating active emotional intelligence abilities, understanding, insight, empathy, and interpersonal skills. The test set contains 45 challenging roleplay scenarios, most of which constitute pre-written prompts spanning 3 turns. The benchmark evaluates the performance of models by validating responses against several criteria and conducts pairwise comparisons to report a normalized Elo computation for each model.

Leaderboard

Showing 2 of 2 results

Grok-4.1 Thinking

1,586

i
Grok-4.1

1,585

i