Phi-3.5-mini-instruct

Phi-3.5-mini-instruct is a 3.8B-parameter model that supports up to 128K context tokens, with improved multilingual capabilities across over 20 languages. It underwent additional training and safety post-training to enhance instruction-following, reasoning, math, and code generation.

GSM8k

86.2%

i
ARC-C

84.6%

i
RULER

84.1%

i
PIQA

81.0%

i
OpenBookQA

79.2%

i
BoolQ

78.0%

i
RepoQA

77.0%

i
Social IQa

74.7%

i
MEGA XStoryCloze

73.5%

i
MBPP

69.6%

i
HellaSwag

69.4%

i
BIG-Bench Hard

69.0%

i
MMLU

69.0%

i
Winogrande

68.5%

i
TruthfulQA

64.0%

i
MEGA XCOPA

63.1%

i
HumanEval

62.8%

i
MEGA TyDi QA

62.2%

i
MEGA MLQA

61.7%

i
MMMLU

55.4%

i
MATH

48.5%

i
MGSM

47.9%

i
MMLU-Pro

47.4%

i
MEGA UDPOS

46.5%

i
Qasper

41.9%

i
Arena Hard

37.0%

i
GPQA

30.4%

i
GovReport

25.9%

i
SQuALITY

24.3%

i
QMSum

21.3%

i
SummScreenFD

16.0%

i