Kimi K2 Base

Kimi K2 base model is a state-of-the-art mixture-of-experts (MoE) language model with 32 billion activated parameters and 1 trillion total parameters. Trained on 15.5 trillion tokens with the MuonClip optimizer, this is the foundation model before instruction tuning.

C-Eval

92.5%

i
GSM8k

92.1%

i
MMLU-redux-2.0

90.2%

i
MMLU

87.8%

i
TriviaQA

85.1%

i
EvalPlus

80.3%

i
CSimpleQA

77.6%

i
MATH

70.2%

i
MMLU-Pro

69.2%

i
GPQA

48.1%

i
SuperGPQA

44.7%

i
SimpleQA

35.3%

i
LiveCodeBench v6

26.3%

i