Gemini Diffusion (original) (raw)

Gemini Diffusion

Our state-of-the-art, experimental text diffusion model

Large-language models are the foundation of generative AI today. We’re using a technique called diffusion to explore a new kind of language model that gives users greater control, creativity, and speed in text generation.

Overview
Capabilities
Performance

Rapid response

Generates content significantly faster than even our fastest model so far.

More coherent text

Generates entire blocks of tokens at once, meaning it responds more coherently to a user’s prompt than autoregressive models.

Corrects errors during generation for more consistent outputs.

Benchmark	Gemini Diffusion	Gemini 2.0 Flash-Lite
Code LiveCodeBench (v6)	30.9%	28.5%
Code BigCodeBench	45.4%	45.8%
Code LBPP (v2)	56.8%	56.0%
Code SWE-Bench Verified*	22.9%	28.5%
Code HumanEval	89.6%	90.2%
Code MBPP	76.0%	75.8%
Science GPQA Diamond	40.4%	56.5%
Mathematics AIME 2025	23.3%	20.0%
Reasoning BIG-Bench Extra Hard	15.0%	21.0%
Multilingual Global MMLU (Lite)	69.1%	79.0%

Sampling speed excluding overhead

1479 tokens / sec

Overhead

0.84 sec

Try Gemini Diffusion

Gemini Diffusion (original) (raw)

Our state-of-the-art, experimental text diffusion model

Rapid response

More coherent text

Iterative refinement

Gemini Diffusion is currently available as an experimental demo to help develop and refine future models.