Kimi K2 Base

Kimi K2 base model is a state-of-the-art mixture-of-experts (MoE) language model with 32 billion activated parameters and 1 trillion total parameters. Trained on 15.5 trillion tokens with the MuonClip optimizer, this is the foundation model before instruction tuning. It demonstrates strong performance on knowledge, reasoning, and coding benchmarks while being optimized for agentic capabilities.

Benchmark results

Benchmark Score Tags Source
C-Eval 92.5% self-reported llm-stats link →
CSimpleQA 77.6% self-reported llm-stats link →
EvalPlus 80.3% self-reported llm-stats link →
GPQA 48.1% self-reported llm-stats link →
GSM8k 92.1% self-reported llm-stats link →
LiveCodeBench v6 26.3% self-reported llm-stats link →
MATH 70.2% self-reported llm-stats link →
MMLU 87.8% self-reported llm-stats link →
MMLU-Pro 69.2% self-reported llm-stats link →
MMLU-redux-2.0 90.2% self-reported llm-stats link →
SimpleQA 35.3% self-reported llm-stats link →
SuperGPQA 44.7% self-reported llm-stats link →
TriviaQA 85.1% self-reported llm-stats link →