HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

Alias-Free Generative Adversarial Networks

Tero Karras Miika Aittala Samuli Laine Erik Härkönen Janne Hellsten Jaakko Lehtinen Timo Aila

Alias-Free Generative Adversarial Networks

Abstract

We observe that despite their hierarchical convolutional nature, the synthesis process of typical generative adversarial networks depends on absolute pixel coordinates in an unhealthy manner. This manifests itself as, e.g., detail appearing to be glued to image coordinates instead of the surfaces of depicted objects. We trace the root cause to careless signal processing that causes aliasing in the generator network. Interpreting all signals in the network as continuous, we derive generally applicable, small architectural changes that guarantee that unwanted information cannot leak into the hierarchical synthesis process. The resulting networks match the FID of StyleGAN2 but differ dramatically in their internal representations, and they are fully equivariant to translation and rotation even at subpixel scales. Our results pave the way for generative models better suited for video and animation.

Code Repositories

lzhbrian/alias-free-gan-explanation
pytorch
Mentioned in GitHub
NVlabs/stylegan3
Official
pytorch
jychoi118/toward_spatial_unbiased
pytorch
Mentioned in GitHub
duskvirkus/alias-free-gan
pytorch
Mentioned in GitHub
rosinality/alias-free-gan-pytorch
pytorch
Mentioned in GitHub
kunheek/style-aware-discriminator
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
image-generation-on-afhqv2Alias-Free-R
EQ-R: 40.34
EQ-T: 64.89
FID: 4.40
image-generation-on-afhqv2StyleGAN2
EQ-R: 11.50
EQ-T: 13.83
FID: 4.62
image-generation-on-afhqv2Alias-Free-T
EQ-R: 13.51
EQ-T: 60.15
FID: 4.04
image-generation-on-ffhq-1024-x-1024StyleGAN3-T
FID: 2.79
image-generation-on-ffhq-1024-x-1024StyleGAN3-R
FID: 3.07
image-generation-on-ffhq-uStyleGAN2 + Simplified generator
EQ-R: 10.41
EQ-T: 19.47
FID: 5.21
image-generation-on-ffhq-uStyleGAN2 + Non-critical sampling
EQ-R: 10.84
EQ-T: 43.90
FID: 4.78
image-generation-on-ffhq-uStyleGAN2 + No noise inputs
EQ-R: 10.84
EQ-T: 15.81
FID: 4.54
image-generation-on-ffhq-uStyleGAN2 + Rotation equiv. (Alias-Free-R)
EQ-R: 40.48
EQ-T: 66.65
FID: 4.50
image-generation-on-ffhq-uStyleGAN2 + Transformed Fourier features
EQ-R: 10.61
EQ-T: 45.20
FID: 4.64
image-generation-on-ffhq-uStyleGAN2 + Flexible layers (Alias-Free-T)
EQ-R: 13.12
EQ-T: 63.01
FID: 4.62
image-generation-on-ffhq-uAlias-Free-R
EQ-R: 47.64
EQ-T: 64.78
FID: 3.66
image-generation-on-ffhq-uStyleGAN2 + Fourier features
EQ-R: 10.81
EQ-T: 16.23
FID: 4.79
image-generation-on-ffhq-uStyleGAN2 + Boundaries & upsampling
EQ-R: 10.97
EQ-T: 24.62
FID: 6.02
image-generation-on-ffhq-uStyleGAN2
FID: 5.14
image-generation-on-ffhq-uStyleGAN2 + Filtered nonlinearities
EQ-R: 10.81
EQ-T: 30.60
FID: 6.35
image-generation-on-ffhq-uAlias-Free-T
EQ-R: 13.95
EQ-T: 61.69
FID: 3.67
image-generation-on-ffhq-uStyleGAN2 (70000 img, 1024^2, train from scratch)
EQ-R: 10.79
EQ-T: 15.89
FID: 3.79
image-generation-on-metfaces-uAlias-Free-T
EQ-R: 16.63
EQ-T: 64.11
FID: 18.75
image-generation-on-metfaces-uAlias-Free-R
EQ-R: 48.57
EQ-T: 66.34
FID: 18.75
image-generation-on-metfaces-uStyleGAN2
EQ-R: 13.19
EQ-T: 18.77
FID: 18.98

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Alias-Free Generative Adversarial Networks | Papers | HyperAI