Creative Writing v3
creativity official site →
EQ-Bench Creative Writing v3 is an LLM-judged creative writing benchmark that evaluates models across 32 writing prompts with 3 iterations per prompt. Uses a hybrid scoring system combining rubric assessment and Elo ratings through pairwise comparisons. Challenges models in areas like humor, romance, spatial awareness, and unique perspectives to assess emotional intelligence and creative writing capabilities.
Methodology
Imported from llm-stats public benchmark metadata. Modality: text. Max score: 1. Categories: creativity, writing. Language: en. Verified by llm-stats: no.