HyperAIHyperAI

Command Palette

Search for a command to run...

4 months ago

Unsupervised Neural Machine Translation Initialized by Unsupervised Statistical Machine Translation

Benjamin Marie; Atsushi Fujita

Unsupervised Neural Machine Translation Initialized by Unsupervised Statistical Machine Translation

Abstract

Recent work achieved remarkable results in training neural machine translation (NMT) systems in a fully unsupervised way, with new and dedicated architectures that rely on monolingual corpora only. In this work, we propose to define unsupervised NMT (UNMT) as NMT trained with the supervision of synthetic bilingual data. Our approach straightforwardly enables the use of state-of-the-art architectures proposed for supervised NMT by replacing human-made bilingual data with synthetic bilingual data for training. We propose to initialize the training of UNMT with synthetic bilingual data generated by unsupervised statistical machine translation (USMT). The UNMT system is then incrementally improved using back-translation. Our preliminary experiments show that our approach achieves a new state-of-the-art for unsupervised machine translation on the WMT16 German--English news translation task, for both translation directions.

Benchmarks

BenchmarkMethodologyMetrics
unsupervised-machine-translation-on-wmt2016Synthetic bilingual data init
BLEU: 20.0
unsupervised-machine-translation-on-wmt2016-1Synthetic bilingual data init
BLEU: 26.7

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Unsupervised Neural Machine Translation Initialized by Unsupervised Statistical Machine Translation | Papers | HyperAI