LongCat-Flash-Thinking-2601

LongCat-Flash-Thinking-2601 is an upgraded version of LongCat-Flash-Thinking with 560B total parameters (MoE, ~27B activated). It achieves open-source SOTA performance on core evaluation benchmarks including Agentic Search, Agentic Tool Use, and Tool-Integrated Reasoning (TIR).

AIME 2025

99.6%

i
Tau2 Telecom

99.3%

i
Tau2 Retail

88.6%

i
LiveCodeBench

82.8%

i
GPQA

80.5%

i
IMO-AnswerBench

78.6%

i
Tau2 Airline

76.5%

i
SWE-Bench Verified

70.0%

i
BrowseComp-zh

69.0%

i
BrowseComp

56.6%

i
Humanity's Last Exam

25.2%

i