QwQ-32B-Preview

An experimental research model focused on advancing AI reasoning capabilities, particularly excelling in mathematics and programming. Features deep introspection and self-questioning abilities while having some limitations in language mixing and recursive reasoning patterns.

Benchmark results

Benchmark Score Tags Source
AIME 2024 50.0% self-reported llm-stats link →
GPQA 65.2% self-reported llm-stats link →
LiveCodeBench 50.0% self-reported llm-stats link →
MATH-500 90.6% self-reported llm-stats link →