HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

NVAE: A Deep Hierarchical Variational Autoencoder

Arash Vahdat Jan Kautz

NVAE: A Deep Hierarchical Variational Autoencoder

Abstract

Normalizing flows, autoregressive models, variational autoencoders (VAEs), and deep energy-based models are among competing likelihood-based frameworks for deep generative learning. Among them, VAEs have the advantage of fast and tractable sampling and easy-to-access encoding networks. However, they are currently outperformed by other models such as normalizing flows and autoregressive models. While the majority of the research in VAEs is focused on the statistical challenges, we explore the orthogonal direction of carefully designing neural architectures for hierarchical VAEs. We propose Nouveau VAE (NVAE), a deep hierarchical VAE built for image generation using depth-wise separable convolutions and batch normalization. NVAE is equipped with a residual parameterization of Normal distributions and its training is stabilized by spectral regularization. We show that NVAE achieves state-of-the-art results among non-autoregressive likelihood-based models on the MNIST, CIFAR-10, CelebA 64, and CelebA HQ datasets and it provides a strong baseline on FFHQ. For example, on CIFAR-10, NVAE pushes the state-of-the-art from 2.98 to 2.91 bits per dimension, and it produces high-quality images on CelebA HQ. To the best of our knowledge, NVAE is the first successful VAE applied to natural images as large as 256$\times$256 pixels. The source code is available at https://github.com/NVlabs/NVAE .

Code Repositories

Aiwizo/template-nvae
pytorch
Mentioned in GitHub
lagergren-lab/miselbo
pytorch
Mentioned in GitHub
tcl9876/visual-vae
jax
Mentioned in GitHub
NVlabs/NVAE
Official
pytorch
Mentioned in GitHub
NVlabs/LSGM
pytorch
Mentioned in GitHub
oadonca/ANVAE
tf
Mentioned in GitHub
SerezD/NVAE-from-scratch
pytorch
Mentioned in GitHub
etotheipi/nvae_tensorflow
tf
Mentioned in GitHub
chethankodase/alma
pytorch
Mentioned in GitHub
NVlabs/VAEBM
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
image-generation-on-celeba-256x256NVAE w/ flow
bpd: 0.70
image-generation-on-cifar-10NVAE w/ flow
FID: 32.53
image-generation-on-ffhq-256-x-256NVAE w/ flow
bits/dimension: 0.69
image-generation-on-imagenet-32x32NVAE w/ flow
bpd: 3.92

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
NVAE: A Deep Hierarchical Variational Autoencoder | Papers | HyperAI