Mistral Large 3

Mistral Large 3 is a state-of-the-art general-purpose Multimodal granular Mixture-of-Experts model with 41B active parameters and 675B total parameters trained from scratch with 3000 H200s. This model is the instruct post-trained version, fine-tuned for instruction tasks, making it ideal for chat, agentic and instruction based use cases.

MATH

90.4%

i
MMLU-Redux

82.0%

i
TriviaQA

74.9%

i
MMMLU

74.2%

i
Wild Bench

68.5%

i
MATH (CoT)

67.6%

i
Arena Hard

55.1%

i
MM-MT-Bench

84.9

i