MAI-Thinking-1

MAI-Thinking-1 is Microsoft AI's first in-house reasoning model, a 35B-active / ~1T-total parameter sparse Mixture of Experts model (base model MAI-Base-1) trained from scratch without distillation from third-party models. Built with Microsoft's Hill-Climbing Machine pipeline, it was pre-trained on 30T tokens of clean, commercially licensed, human-generated data (plus 3.55T mid-training tokens), then post-trained via reinforcement learning across STEM, agentic coding, and helpfulness/safety specialists consolidated into a single model.

LongFact

98.0%

i
AIME 2025

97.0%

i
AIME 2026

94.5%

i
GraphWalks

90.0%

i
AIR-Bench

88.0%

i
TruthfulQA

88.0%

i
LiveCodeBench v6

87.7%

i
AdvancedIF

85.0%

i
MMLU-Pro

85.0%

i
HMMT Feb 26

84.9%

i
GPQA

84.2%

i
CorpusQA

82.0%

i
SWE-Bench Verified

73.5%

i
BFCL-v3

72.0%

i
IFBench

69.0%

i
CyberSecEval 4

63.0%

i
LongBench v2

61.0%

i
Multi-Challenge

53.0%

i
SWE-Bench Pro

52.8%

i
Terminal-Bench 2.0

46.0%

i
MedXpertQA

43.0%

i
HealthBench Professional

35.0%

i
SimpleQA Verified

31.0%

i