Claude Opus 4.1

Claude Opus 4.1 is a hybrid reasoning model that pushes the frontier for coding and AI agents, featuring a 200K context window. It delivers superior performance and precision for real-world coding and agentic tasks, handling complex multi-step problems with rigor and attention to detail.

MMMLU

89.5%

i
TAU-bench Retail

82.4%

i
GPQA

80.9%

i
AIME 2025

78.0%

i
MMMU (validation)

77.1%

i
SWE-Bench Verified

74.5%

i
TAU-bench Airline

56.0%

i
Terminal-Bench

43.3%

i