Gemini Diffusion

Gemini Diffusion is a state-of-the-art, experimental text diffusion model from Google DeepMind. It explores a new kind of language model designed to provide users with greater control, creativity, and speed in text generation. Instead of predicting text token-by-token, it learns to generate outputs by refining noise step-by-step, allowing for rapid iteration and error correction during generation. Key capabilities include rapid response times (reportedly 1479 tokens/sec excluding overhead), generation of more coherent text by outputting entire blocks of tokens at once, and iterative refinement for consistent outputs. It excels at tasks like editing, including in math and code contexts.

Benchmark results

Benchmark Score Tags Source
AIME 2025 23.3% self-reported llm-stats link →
BIG-Bench Extra Hard 15.0% self-reported llm-stats link →
BigCodeBench 45.4% self-reported llm-stats link →
Global-MMLU-Lite 69.1% self-reported llm-stats link →
GPQA 40.4% self-reported llm-stats link →
HumanEval 89.6% self-reported llm-stats link →
LBPP (v2) 56.8% self-reported llm-stats link →
LiveCodeBench 30.9% self-reported llm-stats link →
MBPP 76.0% self-reported llm-stats link →
SWE-Bench Verified 22.9% self-reported llm-stats link →