GPT-4

GPT-4 is a large multimodal model capable of processing both image and text inputs and generating human-like text outputs. It demonstrates human-level performance on various professional and academic benchmarks.

AI2 Reasoning Challenge (ARC)

96.3%

i
HellaSwag

95.3%

i
Uniform Bar Exam

90.0%

i
SAT Math

89.0%

i
LSAT

88.0%

i
Winogrande

87.5%

i
MMLU

86.4%

i
DROP

80.9%

i
MGSM

74.5%

i
HumanEval

67.0%

i
MATH

42.0%

i
GPQA

35.7%

i