Claude 3 Sonnet
Claude 3 Sonnet strikes the ideal balance between intelligence and speed—particularly for enterprise workloads. It delivers strong performance at a lower cost compared to its peers, and is engineered for high endurance in large-scale AI deployments.
Benchmark results
| Benchmark | Score | Tags | Source |
|---|---|---|---|
| ARC-C | 93.2% | self-reported llm-stats | link → |
| BIG-Bench Hard | 82.9% | self-reported llm-stats | link → |
| DROP | 78.9% | self-reported llm-stats | link → |
| GPQA | 40.4% | self-reported llm-stats | link → |
| GSM8k | 92.3% | self-reported llm-stats | link → |
| HellaSwag | 89.0% | self-reported llm-stats | link → |
| HumanEval | 73.0% | self-reported llm-stats | link → |
| MATH | 43.1% | self-reported llm-stats | link → |
| MGSM | 83.5% | self-reported llm-stats | link → |
| MMLU | 79.0% | self-reported llm-stats | link → |
| MMLU-Pro | 56.8% | self-reported llm-stats | link → |