Magistral Small 2506

Building upon Mistral Small 3.1 (2503), with added reasoning capabilities, undergoing SFT from Magistral Medium traces and RL on top, it's a small, efficient reasoning model with 24B parameters. Magistral Small can be deployed locally, fitting within a single RTX 4090 or a 32GB RAM MacBook once quantized.

Benchmark results

Benchmark Score Tags Source
AIME 2024 70.7% self-reported llm-stats link →
AIME 2025 62.8% self-reported llm-stats link →
GPQA 68.2% self-reported llm-stats link →
LiveCodeBench 51.3% self-reported llm-stats link →