HyperAIHyperAI

Command Palette

Search for a command to run...

4 months ago

Improved Variational Autoencoders for Text Modeling using Dilated Convolutions

Zichao Yang; Zhiting Hu; Ruslan Salakhutdinov; Taylor Berg-Kirkpatrick

Improved Variational Autoencoders for Text Modeling using Dilated Convolutions

Abstract

Recent work on generative modeling of text has found that variational auto-encoders (VAE) incorporating LSTM decoders perform worse than simpler LSTM language models (Bowman et al., 2015). This negative result is so far poorly understood, but has been attributed to the propensity of LSTM decoders to ignore conditioning information from the encoder. In this paper, we experiment with a new type of decoder for VAE: a dilated CNN. By changing the decoder's dilation architecture, we control the effective context from previously generated words. In experiments, we find that there is a trade off between the contextual capacity of the decoder and the amount of encoding information used. We show that with the right decoder, VAE can outperform LSTM language models. We demonstrate perplexity gains on two datasets, representing the first positive experimental result on the use VAE for generative modeling of text. Further, we conduct an in-depth investigation of the use of VAE (with our new decoding architecture) for semi-supervised and unsupervised labeling tasks, demonstrating gains over several strong baselines.

Code Repositories

ryokamoi/dcnn_textvae
tf
Mentioned in GitHub
harvardnlp/sa-vae
pytorch
Mentioned in GitHub
kefirski/contiguous-succotash
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
text-generation-on-yahoo-questionsCNN-VAE
KL: 10.0
NLL: 332.1
Perplexity: 63.9

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Improved Variational Autoencoders for Text Modeling using Dilated Convolutions | Papers | HyperAI