HyperAIHyperAI

Command Palette

Search for a command to run...

4 months ago

Long Text Generation via Adversarial Training with Leaked Information

Jiaxian Guo; Sidi Lu; Han Cai; Weinan Zhang; Yong Yu; Jun Wang

Long Text Generation via Adversarial Training with Leaked Information

Abstract

Automatically generating coherent and semantically meaningful text has many applications in machine translation, dialogue systems, image captioning, etc. Recently, by combining with policy gradient, Generative Adversarial Nets (GAN) that use a discriminative model to guide the training of the generative model as a reinforcement learning policy has shown promising results in text generation. However, the scalar guiding signal is only available after the entire text has been generated and lacks intermediate information about text structure during the generative process. As such, it limits its success when the length of the generated text samples is long (more than 20 words). In this paper, we propose a new framework, called LeakGAN, to address the problem for long text generation. We allow the discriminative net to leak its own high-level extracted features to the generative net to further help the guidance. The generator incorporates such informative signals into all generation steps through an additional Manager module, which takes the extracted features of current generated words and outputs a latent vector to guide the Worker module for next-word generation. Our extensive experiments on synthetic data and various real-world tasks with Turing test demonstrate that LeakGAN is highly effective in long text generation and also improves the performance in short text generation scenarios. More importantly, without any supervision, LeakGAN would be able to implicitly learn sentence structures only through the interaction between Manager and Worker.

Code Repositories

rupes438/CodeGen
Mentioned in GitHub
nurpeiis/LeakGAN-PyTorch
pytorch
Mentioned in GitHub
valko073/LyricsGANs
tf
Mentioned in GitHub
liyzcj/leakgan-py3
tf
Mentioned in GitHub
CR-Gjx/LeakGAN
Official
tf
Mentioned in GitHub
universebh/text_generation_fsa_gan
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
text-generation-on-chinese-poemsLeakGAN
BLEU-2: 0.456
text-generation-on-coco-captionsLeakGAN
BLEU-2: 0.950
BLEU-3: 0.880
BLEU-4: 0.778
BLEU-5: 0.686
text-generation-on-coco-captionspartGAN
BLEU-2: 0.910
BLEU-3: 0.713
BLEU-4: O.753
BLEU-5: 0.590
text-generation-on-emnlp2017-wmtLeakGAN
BLEU-2: 0.956
BLEU-3: 0.819
BLEU-4: 0.627
BLEU-5: 0.498

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp