DeepSeek-V4-Flash-Max

DeepSeek-V4-Flash-Max is the maximum reasoning effort mode of DeepSeek-V4-Flash, a 284B-parameter MoE model with 13B activated parameters and a 1M-token context window. Sharing the V4 series' hybrid attention architecture (Compressed Sparse Attention combined with Heavily Compressed Attention), Manifold-Constrained Hyper-Connections, and Muon optimizer, V4-Flash-Max delivers reasoning performance comparable to V4-Pro when given a larger thinking budget while operating at a fraction of the parameter scale.

CodeForces

100.0%

i
HMMT Feb 26

94.8%

i
LiveCodeBench

91.6%

i
IMO-AnswerBench

88.4%

i
GPQA

88.1%

i
MMLU-Pro

86.2%

i
MathArena Apex

85.7%

i
SWE-Bench Verified

79.0%

i
CSimpleQA

78.9%

i
MRCR 1M

78.7%

i
SWE-bench Multilingual

73.3%

i
BrowseComp

73.2%

i
MCP Atlas

69.0%

i
CorpusQA 1M

60.5%

i
Terminal-Bench 2.0

56.9%

i
SWE-Bench Pro

52.6%

i
Toolathlon

47.8%

i
Humanity's Last Exam

45.1%

i
SimpleQA

34.1%

i
GDPval-AA

1,395

i