Kimi K2-Thinking-0905

Kimi K2 Thinking is the latest, most capable version of open-source thinking model. Starting with Kimi K2, it is built as a thinking agent that reasons step-by-step while dynamically invoking tools.

AIME 2025

100.0%

i
HMMT 2025

97.5%

i
MMLU-Redux

94.4%

i
FRAMES

87.0%

i
MMLU-Pro

84.6%

i
GPQA

84.5%

i
LiveCodeBench v6

83.1%

i
IMO-AnswerBench

78.6%

i
WritingBench

73.8%

i
SWE-Bench Verified

71.3%

i
BrowseComp-zh

62.3%

i
SWE-bench Multilingual

61.1%

i
BrowseComp

60.2%

i
HealthBench

58.0%

i
Seal-0

56.3%

i
Humanity's Last Exam

51.0%

i
OJBench

48.7%

i
FinSearchComp-T3

47.4%

i
Terminal-Bench

47.1%

i
SciCode

44.8%

i
Multi-SWE-Bench

41.9%

i