HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

Diffusion Models With Learned Adaptive Noise

Subham Sekhar Sahoo Aaron Gokaslan Chris De Sa Volodymyr Kuleshov

Diffusion Models With Learned Adaptive Noise

Abstract

Diffusion models have gained traction as powerful algorithms for synthesizing high-quality images. Central to these algorithms is the diffusion process, a set of equations which maps data to noise in a way that can significantly affect performance. In this paper, we explore whether the diffusion process can be learned from data. Our work is grounded in Bayesian inference and seeks to improve log-likelihood estimation by casting the learned diffusion process as an approximate variational posterior that yields a tighter lower bound (ELBO) on the likelihood. A widely held assumption is that the ELBO is invariant to the noise process: our work dispels this assumption and proposes multivariate learned adaptive noise (MULAN), a learned diffusion process that applies noise at different rates across an image. Specifically, our method relies on a multivariate noise schedule that is a function of the data to ensure that the ELBO is no longer invariant to the choice of the noise schedule as in previous works. Empirically, MULAN sets a new state-of-the-art in density estimation on CIFAR-10 and ImageNet and reduces the number of training steps by 50%. We provide the code, along with a blog post and video tutorial on the project page: https://s-sahoo.com/MuLAN

Code Repositories

s-sahoo/mulan
Official
jax
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
density-estimation-on-cifar-10MULAN
NLL (bits/dim): 2.55
density-estimation-on-imagenet-32x32-1MuLAN
NLL (bits/dim): 3.67

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Diffusion Models With Learned Adaptive Noise | Papers | HyperAI