Command Palette
Search for a command to run...
Jonathan Ho Ajay Jain Pieter Abbeel

Abstract
We present high quality image synthesis results using diffusion probabilistic models, a class of latent variable models inspired by considerations from nonequilibrium thermodynamics. Our best results are obtained by training on a weighted variational bound designed according to a novel connection between diffusion probabilistic models and denoising score matching with Langevin dynamics, and our models naturally admit a progressive lossy decompression scheme that can be interpreted as a generalization of autoregressive decoding. On the unconditional CIFAR10 dataset, we obtain an Inception score of 9.46 and a state-of-the-art FID score of 3.17. On 256x256 LSUN, we obtain sample quality similar to ProgressiveGAN. Our implementation is available at https://github.com/hojonathanho/diffusion
Code Repositories
Benchmarks
| Benchmark | Methodology | Metrics |
|---|---|---|
| density-estimation-on-cifar-10 | DDPM | NLL (bits/dim): 3.69 |
| image-generation-on-cifar-10 | Denoising Diffusion | FID: 3.17 |
| image-generation-on-imagenet-32x32 | DDPM | FID: 16.18 bpd: 3.89 |
| image-generation-on-lsun-bedroom-1 | Denoising Diffusion Probabilistic Model | FID-50k: 4.9 |
| image-generation-on-lsun-bedroom-256-x-256 | Denoising Diffusion Probabilistic Model (large) | FID: 4.9 |
| image-generation-on-lsun-bedroom-256-x-256 | Denoising Diffusion Probabilistic Model (large, DINOv2) | FD: 229.76 Precision: 0.79 Recall: 0.61 |
| image-generation-on-lsun-bedroom-256-x-256 | Denoising Diffusion Probabilistic Model | FID: 6.36 |
| image-generation-on-lsun-cat-256-x-256 | Denoising Diffusion Probabilistic Model | FID: 19.75 |
| image-generation-on-lsun-churches-256-x-256 | Denoising Diffusion Probabilistic Model | FID: 7.89 |
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.