HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

Data Diversification: A Simple Strategy For Neural Machine Translation

Xuan-Phi Nguyen Shafiq Joty Wu Kui Ai Ti Aw

Data Diversification: A Simple Strategy For Neural Machine Translation

Abstract

We introduce Data Diversification: a simple but effective strategy to boost neural machine translation (NMT) performance. It diversifies the training data by using the predictions of multiple forward and backward models and then merging them with the original dataset on which the final NMT model is trained. Our method is applicable to all NMT models. It does not require extra monolingual data like back-translation, nor does it add more computations and parameters like ensembles of models. Our method achieves state-of-the-art BLEU scores of 30.7 and 43.7 in the WMT'14 English-German and English-French translation tasks, respectively. It also substantially improves on 8 other translation tasks: 4 IWSLT tasks (English-German and English-French) and 4 low-resource translation tasks (English-Nepali and English-Sinhala). We demonstrate that our method is more effective than knowledge distillation and dual learning, it exhibits strong correlation with ensembles of models, and it trades perplexity off for better BLEU score. We have released our source code at https://github.com/nxphi47/data_diversification

Code Repositories

nxphi47/multiagent_crosstranslate
pytorch
Mentioned in GitHub
nxphi47/data_diversification
Official
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
machine-translation-on-iwslt2014-germanData Diversification
BLEU score: 37.2
machine-translation-on-wmt2014-english-germanData Diversification - Transformer
BLEU score: 30.7
Hardware Burden:
Operations per network pass:

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Data Diversification: A Simple Strategy For Neural Machine Translation | Papers | HyperAI