Claude 3 Sonnet

Claude 3 Sonnet strikes the ideal balance between intelligence and speed—particularly for enterprise workloads. It delivers strong performance at a lower cost compared to its peers, and is engineered for high endurance in large-scale AI deployments.

Benchmark results

Benchmark Score Tags Source
ARC-C 93.2% self-reported llm-stats link →
BIG-Bench Hard 82.9% self-reported llm-stats link →
DROP 78.9% self-reported llm-stats link →
GPQA 40.4% self-reported llm-stats link →
GSM8k 92.3% self-reported llm-stats link →
HellaSwag 89.0% self-reported llm-stats link →
HumanEval 73.0% self-reported llm-stats link →
MATH 43.1% self-reported llm-stats link →
MGSM 83.5% self-reported llm-stats link →
MMLU 79.0% self-reported llm-stats link →
MMLU-Pro 56.8% self-reported llm-stats link →