HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

Wavelet Diffusion Models are fast and scalable Image Generators

Hao Phung Quan Dao Anh Tran

Wavelet Diffusion Models are fast and scalable Image Generators

Abstract

Diffusion models are rising as a powerful solution for high-fidelity image generation, which exceeds GANs in quality in many circumstances. However, their slow training and inference speed is a huge bottleneck, blocking them from being used in real-time applications. A recent DiffusionGAN method significantly decreases the models' running time by reducing the number of sampling steps from thousands to several, but their speeds still largely lag behind the GAN counterparts. This paper aims to reduce the speed gap by proposing a novel wavelet-based diffusion scheme. We extract low-and-high frequency components from both image and feature levels via wavelet decomposition and adaptively handle these components for faster processing while maintaining good generation quality. Furthermore, we propose to use a reconstruction term, which effectively boosts the model training convergence. Experimental results on CelebA-HQ, CIFAR-10, LSUN-Church, and STL-10 datasets prove our solution is a stepping-stone to offering real-time and high-fidelity diffusion models. Our code and pre-trained checkpoints are available at \url{https://github.com/VinAIResearch/WaveDiff.git}.

Code Repositories

vinairesearch/wavediff
Official
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
image-generation-on-celeba-hq-1024x1024WaveDiff
FID: 5.98
NFE: 2
image-generation-on-celeba-hq-256x256WaveDiff
FID: 5.94
NFE: 2
Recall: 0.37
image-generation-on-celeba-hq-512x512WaveDiff
FID: 6.40
NFE: 2
Recall: 0.35
image-generation-on-lsun-churches-256-x-256WaveDiff
FID: 5.06
NFE: 4
Recall: 0.40
image-generation-on-stl-10WaveDiff
FID: 12.93
NFE: 4
Recall: 0.41

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Wavelet Diffusion Models are fast and scalable Image Generators | Papers | HyperAI