4 months ago

Levenshtein Transformer

Jiatao Gu; Changhan Wang; Jake Zhao

Abstract

Modern neural sequence generation models are built to either generate tokens step-by-step from scratch or (iteratively) modify a sequence of tokens bounded by a fixed length. In this work, we develop Levenshtein Transformer, a new partially autoregressive model devised for more flexible and amenable sequence generation. Unlike previous approaches, the atomic operations of our model are insertion and deletion. The combination of them facilitates not only generation but also sequence refinement allowing dynamic length changes. We also propose a set of new training techniques dedicated at them, effectively exploiting one as the other's learning signal thanks to their complementary nature. Experiments applying the proposed model achieve comparable performance but much-improved efficiency on both generation (e.g. machine translation, text summarization) and refinement tasks (e.g. automatic post-editing). We further confirm the flexibility of our model by showing a Levenshtein Transformer trained by machine translation can straightforwardly be used for automatic post-editing.

Code Repositories

pytorch/fairseq

Official

pytorch

maxwell1447/fairseq

pytorch

Mentioned in GitHub

ictnlp/Seq-NAT

pytorch

Mentioned in GitHub

Benchmarks

Benchmark	Methodology	Metrics
machine-translation-on-wmt2014-english-german	Levenshtein Transformer (distillation)	BLEU score: 27.27
machine-translation-on-wmt2016-romanian	Levenshtein Transformer (distillation)	BLEU score: 33.26

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started

Hyper Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

Command Palette

Levenshtein Transformer

Jiatao Gu; Changhan Wang; Jake Zhao

Abstract

Code Repositories

Benchmarks

Build AI with AI

Hyper Newsletters