Sarvam-105B

Sarvam-105B is Sarvam AI's flagship open-source Mixture-of-Experts reasoning model built for complex reasoning, coding, and agentic workflows. It uses 128 sparse experts with Multi-head Latent Attention for efficient long-context inference and was pre-trained on 12 trillion tokens spanning code, mathematics, multilingual, and web data.

MATH-500

98.6%

i
AIME 2025

96.7%

i
MMLU

90.6%

i
HMMT 2025

85.8%

i
HMMT25

85.8%

i
IFEval

84.8%

i
MMLU-Pro

81.7%

i
GPQA

78.7%

i
LiveCodeBench v6

71.7%

i
Arena-Hard v2

71.0%

i
Beyond AIME

69.1%

i
BrowseComp

49.5%

i
SWE-Bench Verified

45.0%

i
Humanity's Last Exam

11.2%

i