HyperAIHyperAI

Command Palette

Search for a command to run...

4 months ago

Discrete Flows: Invertible Generative Models of Discrete Data

Dustin Tran; Keyon Vafa; Kumar Krishna Agrawal; Laurent Dinh; Ben Poole

Discrete Flows: Invertible Generative Models of Discrete Data

Abstract

While normalizing flows have led to significant advances in modeling high-dimensional continuous distributions, their applicability to discrete distributions remains unknown. In this paper, we show that flows can in fact be extended to discrete events---and under a simple change-of-variables formula not requiring log-determinant-Jacobian computations. Discrete flows have numerous applications. We consider two flow architectures: discrete autoregressive flows that enable bidirectionality, allowing, for example, tokens in text to depend on both left-to-right and right-to-left contexts in an exact language model; and discrete bipartite flows that enable efficient non-autoregressive generation as in RealNVP. Empirically, we find that discrete autoregressive flows outperform autoregressive baselines on synthetic discrete distributions, an addition task, and Potts models; and bipartite flows can obtain competitive performance with autoregressive baselines on character-level language modeling for Penn Tree Bank and text8.

Code Repositories

TrentBrick/PyTorchDiscreteFlows
pytorch
Mentioned in GitHub
google/edward2
Official
tf

Benchmarks

BenchmarkMethodologyMetrics
language-modelling-on-penn-treebank-characterBipartite Flow
Bit per Character (BPC): 1.38
language-modelling-on-text8Bipartite flows (8 flows)
Bit per Character (BPC): 1.23

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Discrete Flows: Invertible Generative Models of Discrete Data | Papers | HyperAI