GDPval-AA
reasoning
GDPval-AA is an evaluation of AI model performance on economically valuable knowledge work tasks across professional domains including finance, legal, and other sectors. Run independently by Artificial Analysis, it uses Elo scoring to rank models on real-world work task performance.
Methodology
Imported from llm-stats public benchmark metadata. Modality: text. Max score: 3000. Categories: agents, finance, general, legal, reasoning. Language: en. Verified by llm-stats: no.