Claude Opus 4

Claude Opus 4 is Anthropic's most powerful model and the world's best coding model, part of the Claude 4 family. It delivers sustained performance on complex, long-running tasks and agent workflows.

MMMLU

88.8%

i
TAU-bench Retail

81.4%

i
GPQA

79.6%

i
MMMU (validation)

76.5%

i
AIME 2025

75.5%

i
SWE-Bench Verified

72.5%

i
TAU-bench Airline

59.6%

i
Terminal-Bench

39.2%

i
ARC-AGI v2

8.6%

i