HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

FlowSeq: Non-Autoregressive Conditional Sequence Generation with Generative Flow

Xuezhe Ma Chunting Zhou Xian Li Graham Neubig Eduard Hovy

FlowSeq: Non-Autoregressive Conditional Sequence Generation with Generative Flow

Abstract

Most sequence-to-sequence (seq2seq) models are autoregressive; they generate each token by conditioning on previously generated tokens. In contrast, non-autoregressive seq2seq models generate all tokens in one pass, which leads to increased efficiency through parallel processing on hardware such as GPUs. However, directly modeling the joint distribution of all tokens simultaneously is challenging, and even with increasingly complex model structures accuracy lags significantly behind autoregressive models. In this paper, we propose a simple, efficient, and effective model for non-autoregressive sequence generation using latent variable models. Specifically, we turn to generative flow, an elegant technique to model complex distributions using neural networks, and design several layers of flow tailored for modeling the conditional density of sequential latent variables. We evaluate this model on three neural machine translation (NMT) benchmark datasets, achieving comparable performance with state-of-the-art non-autoregressive NMT models and almost constant decoding time w.r.t the sequence length.

Code Repositories

XuezheMax/flowseq
Official
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
machine-translation-on-iwslt2015-germanFlowSeq-base
BLEU score: 24.75
machine-translation-on-wmt2014-english-germanFlowSeq-large (IWD n = 15)
BLEU score: 22.94
Hardware Burden:
Operations per network pass:
machine-translation-on-wmt2014-english-germanFlowSeq-base
BLEU score: 18.55
Hardware Burden:
Operations per network pass:
machine-translation-on-wmt2014-english-germanFlowSeq-large (NPD n = 15)
BLEU score: 23.14
Hardware Burden:
Operations per network pass:
machine-translation-on-wmt2014-english-germanFlowSeq-large (NPD n = 30)
BLEU score: 23.64
Hardware Burden:
Operations per network pass:
machine-translation-on-wmt2014-english-germanFlowSeq-large
BLEU score: 20.85
Hardware Burden:
Operations per network pass:
machine-translation-on-wmt2014-german-englishFlowSeq-large (NPD n = 15)
BLEU score: 27.71
machine-translation-on-wmt2014-german-englishFlowSeq-large
BLEU score: 25.4
machine-translation-on-wmt2014-german-englishFlowSeq-base
BLEU score: 23.36
machine-translation-on-wmt2014-german-englishFlowSeq-large (IWD n=15)
BLEU score: 27.16
machine-translation-on-wmt2014-german-englishFlowSeq-large (NPD n = 30)
BLEU score: 28.29
machine-translation-on-wmt2016-english-1FlowSeq-large (NPD n=15)
BLEU score: 31.97
machine-translation-on-wmt2016-english-1FlowSeq-base
BLEU score: 29.26
machine-translation-on-wmt2016-english-1FlowSeq-large
BLEU score: 29.86
machine-translation-on-wmt2016-english-1FlowSeq-large (NPD n = 30)
BLEU score: 32.35
machine-translation-on-wmt2016-english-1FlowSeq-large (IWD n = 15)
BLEU score: 31.08
machine-translation-on-wmt2016-romanianFlowSeq-large (IWD n = 15)
BLEU score: 32.03
machine-translation-on-wmt2016-romanianFlowSeq-large (NPD n = 30)
BLEU score: 32.91
machine-translation-on-wmt2016-romanianFlowSeq-large (NPD n = 15)
BLEU score: 32.46
machine-translation-on-wmt2016-romanianFlowSeq-large
BLEU score: 30.69
machine-translation-on-wmt2016-romanianFlowSeq-base
BLEU score: 30.16

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
FlowSeq: Non-Autoregressive Conditional Sequence Generation with Generative Flow | Papers | HyperAI