HyperAIHyperAI

Command Palette

Search for a command to run...

4 months ago

PixelCNN++: Improving the PixelCNN with Discretized Logistic Mixture Likelihood and Other Modifications

Tim Salimans; Andrej Karpathy; Xi Chen; Diederik P. Kingma

PixelCNN++: Improving the PixelCNN with Discretized Logistic Mixture Likelihood and Other Modifications

Abstract

PixelCNNs are a recently proposed class of powerful generative models with tractable likelihood. Here we discuss our implementation of PixelCNNs which we make available at https://github.com/openai/pixel-cnn. Our implementation contains a number of modifications to the original model that both simplify its structure and improve its performance. 1) We use a discretized logistic mixture likelihood on the pixels, rather than a 256-way softmax, which we find to speed up training. 2) We condition on whole pixels, rather than R/G/B sub-pixels, simplifying the model structure. 3) We use downsampling to efficiently capture structure at multiple resolutions. 4) We introduce additional short-cut connections to further speed up optimization. 5) We regularize the model using dropout. Finally, we present state-of-the-art log likelihood results on CIFAR-10 to demonstrate the usefulness of these modifications.

Code Repositories

tuatruog/astrocompress
pytorch
Mentioned in GitHub
andrecianflone/vector_quantization
pytorch
Mentioned in GitHub
kamenbliznashki/pixel_models
pytorch
Mentioned in GitHub
pclucas14/pixel-cnn-pp
pytorch
Mentioned in GitHub
ajayjain/lmconv
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
density-estimation-on-cifar-10Pixel CNN ++
NLL (bits/dim): 2.92

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
PixelCNN++: Improving the PixelCNN with Discretized Logistic Mixture Likelihood and Other Modifications | Papers | HyperAI