HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

Diffusion-GAN: Training GANs with Diffusion

Zhendong Wang Huangjie Zheng Pengcheng He Weizhu Chen Mingyuan Zhou

Diffusion-GAN: Training GANs with Diffusion

Abstract

Generative adversarial networks (GANs) are challenging to train stably, and a promising remedy of injecting instance noise into the discriminator input has not been very effective in practice. In this paper, we propose Diffusion-GAN, a novel GAN framework that leverages a forward diffusion chain to generate Gaussian-mixture distributed instance noise. Diffusion-GAN consists of three components, including an adaptive diffusion process, a diffusion timestep-dependent discriminator, and a generator. Both the observed and generated data are diffused by the same adaptive diffusion process. At each diffusion timestep, there is a different noise-to-data ratio and the timestep-dependent discriminator learns to distinguish the diffused real data from the diffused generated data. The generator learns from the discriminator's feedback by backpropagating through the forward diffusion chain, whose length is adaptively adjusted to balance the noise and data levels. We theoretically show that the discriminator's timestep-dependent strategy gives consistent and helpful guidance to the generator, enabling it to match the true data distribution. We demonstrate the advantages of Diffusion-GAN over strong GAN baselines on various datasets, showing that it can produce more realistic images with higher stability and data efficiency than state-of-the-art GANs.

Code Repositories

mingyuanzhou/sid-lsg
pytorch
Mentioned in GitHub
zhendong-wang/prompt-diffusion
pytorch
Mentioned in GitHub
Zhendong-Wang/Diffusion-GAN
Official
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
image-generation-on-afhq-catDiffusion InsGen
FID: 2.40
image-generation-on-afhq-dogDiffusion InsGen
FID: 4.83
image-generation-on-afhq-wildDiffusion InsGen
FID: 1.51
image-generation-on-celeba-64x64Diffusion StyleGAN2
FID: 1.69
image-generation-on-ffhq-1024-x-1024Diffusion StyleGAN2
FID: 2.83
image-generation-on-lsun-bedroom-256-x-256Diffusion ProjectedGAN
FID: 1.43
image-generation-on-lsun-bedroom-256-x-256Diffusion StyleGAN2
FID: 3.65
image-generation-on-lsun-bedroom-256-x-256Diffusion ProjectedGAN (DINOv2)
FD: 547.61
Precision: 0.79
Recall: 0.28
image-generation-on-lsun-churches-256-x-256Diffusion ProjectedGAN
FID: 1.85
image-generation-on-lsun-churches-256-x-256Diffusion StyleGAN2
FID: 3.17
image-generation-on-stl-10Diffusion ProjectedGAN
FID: 6.91
image-generation-on-stl-10Diffusion StyleGAN2
FID: 11.53

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Diffusion-GAN: Training GANs with Diffusion | Papers | HyperAI