Phi-3.5-MoE-instruct

Phi-3.5-MoE-instruct is a mixture-of-experts model with ~42B total parameters (6.6B active) and a 128K context window. It excels at reasoning, math, coding, and multilingual tasks, outperforming larger dense models in many benchmarks.

ARC-C

91.0%

i
OpenBookQA

89.6%

i
GSM8k

88.7%

i
PIQA

88.6%

i
RULER

87.1%

i
RepoQA

85.0%

i
BoolQ

84.6%

i
HellaSwag

83.8%

i
MEGA XStoryCloze

82.8%

i
Winogrande

81.3%

i
MBPP

80.8%

i
BIG-Bench Hard

79.1%

i
MMLU

78.9%

i
Social IQa

78.0%

i
TruthfulQA

77.5%

i
MEGA XCOPA

76.6%

i
HumanEval

70.7%

i
MMMLU

69.9%

i
MEGA TyDi QA

67.1%

i
MEGA MLQA

65.3%

i
MEGA UDPOS

60.4%

i
MATH

59.5%

i
MGSM

58.7%

i
MMLU-Pro

45.3%

i
Qasper

40.0%

i
Arena Hard

37.9%

i
GPQA

36.8%

i
GovReport

26.4%

i
SQuALITY

24.1%

i
QMSum

19.9%

i
SummScreenFD

16.9%

i