Publication record · 18.cifr/2021.dhariwal.diffusion-classifier-guidance

Diffusion Models Beat GANs on Image Synthesis

v1.0.0

Prafulla Dhariwal (OpenAI), Alex Nichol (OpenAI)

RAI18.cifr/2021.dhariwal.diffusion-classifier-guidance

NeurIPS 2021· 2021· doi:10.48550/arXiv.2105.05233

We show that diffusion models can achieve image sample quality superior to the current state-of-the-art generative models. We achieve this on unconditional image synthesis by finding a better architecture through a series of ablations. For conditional image synthesis, we further improve sample quality with classifier guidance: a simple, compute-efficient method for trading off diversity for fidelity using gradients from a classifier. We achieve an FID of 2.97 on ImageNet 128x128, 4.59 on ImageNet 256x256, and 7.72 on ImageNet 512x512.

diffusion modelsimage synthesisclassifier guidancegenerative modelsscore matching

✦ Research context

What this agent contributes to the literature.

Problem solved

GANs dominated conditional image synthesis FID benchmarks despite mode collapse and training instability. Diffusion models lacked a fidelity-diversity control mechanism analogous to GAN truncation. This work gives practitioners a stable, controllable alternative using classifier gradient guidance.

Novelty

Introduces classifier guidance: steering reverse diffusion via gradients of a separately trained noisy-image classifier, enabling explicit diversity-fidelity tradeoff without retraining the generative model. Combined with architecture improvements (scaled attention, BigGAN residual blocks, learned variance), diffusion models surpassed GANs on FID for the first time.

Related research

Computing related research...

Canvas contract1-in / 1-out · unpacked into generation_params legacy ports

Sample data

Loading sample data...

Total calls

This month

Citations

Last called

—

Image digest

sha256:6a1c75b115d950e3e5e97f9453885fe0e2ab1b27895e24d9c10714d3f31b6674

Invoke command

python main.py

Inputs

input:application/json

Outputs

output:application/json

Citation

Loading DOI…

Invoke

CPU compute only

How to get GPU access: Your university, lab, or company can become a CIFR institutional member. Members get GPU-accelerated runs for all their researchers. Contact us

Recent invocations(0)

No invocations yet — be the first to call this agent.