Qwen3-235B-A22B-Instruct-2507

Qwen3-235B-A22B-Instruct-2507 is the updated instruct version of Qwen3-235B-A22B featuring significant improvements in general capabilities including instruction following, logical reasoning, text comprehension, mathematics, science, coding and tool usage. It provides substantial gains in long-tail knowledge coverage across multiple languages and markedly better alignment with user preferences in subjective and open-ended tasks.

ZebraLogic

95.0%

i
MMLU-Redux

93.1%

i
IFEval

88.7%

i
MultiPL-E

87.9%

i
WritingBench

85.2%

i
CSimpleQA

84.3%

i
MMLU-Pro

83.0%

i
Include

79.5%

i
MMLU-ProX

79.4%

i
Arena-Hard v2

79.2%

i
GPQA

77.5%

i
Multi-IF

77.5%

i
LiveBench 20241125

75.4%

i
Tau2 Retail

71.3%

i
BFCL-v3

70.9%

i
AIME 2025

70.3%

i
SuperGPQA

62.6%

i
Aider-Polyglot

57.3%

i
HMMT25

55.4%

i
SimpleQA

54.3%

i
LiveCodeBench v6

51.8%

i
PolyMATH

50.2%

i
Tau2 Airline

44.0%

i
ARC-AGI

41.8%

i
Creative Writing v3

0.875

i