HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

Effective Approaches to Attention-based Neural Machine Translation

Minh-Thang Luong; Hieu Pham; Christopher D. Manning

Effective Approaches to Attention-based Neural Machine Translation

Abstract

An attentional mechanism has lately been used to improve neural machine translation (NMT) by selectively focusing on parts of the source sentence during translation. However, there has been little work exploring useful architectures for attention-based NMT. This paper examines two simple and effective classes of attentional mechanism: a global approach which always attends to all source words and a local one that only looks at a subset of source words at a time. We demonstrate the effectiveness of both approaches over the WMT translation tasks between English and German in both directions. With local attention, we achieve a significant gain of 5.0 BLEU points over non-attentional systems which already incorporate known techniques such as dropout. Our ensemble model using different attention architectures has established a new state-of-the-art result in the WMT'15 English to German translation task with 25.9 BLEU points, an improvement of 1.0 BLEU points over the existing best system backed by NMT and an n-gram reranker.

Code Repositories

bplank/teaching-dl4nlp
Mentioned in GitHub
amitz25/PCCoder
pytorch
Mentioned in GitHub
mike-a-yen/date-translation
pytorch
Mentioned in GitHub
jmyrberg/finnlem
tf
Mentioned in GitHub
sooftware/attentions
pytorch
Mentioned in GitHub
neqkir/attention-mechanism
tf
Mentioned in GitHub
hnt4499/seq2seq
pytorch
Mentioned in GitHub
huulinhcvp/chatBot
pytorch
Mentioned in GitHub
saisandeep97/Chat-botV2
pytorch
Mentioned in GitHub
sblayush/Tensorflow-Attention
tf
Mentioned in GitHub
Shubham-SK/kronos
pytorch
Mentioned in GitHub
vikua/keras-attention-models
Mentioned in GitHub
eaplatanios/symphony-mt
tf
Mentioned in GitHub
pbcquoc/transformer
Mentioned in GitHub
biyoml/End-to-End-Mandarin-ASR
pytorch
Mentioned in GitHub
chao-ji/tf-seq2seq
tf
Mentioned in GitHub
vGkatsis/Chat_Bot_DL
pytorch
Mentioned in GitHub
sen-pai/audio-word2vec-pytorch
pytorch
Mentioned in GitHub
T9-LIN/MSc-Project
pytorch
Mentioned in GitHub
dalek7/attention
tf
Mentioned in GitHub
shlokmehrotra/Convocare
pytorch
Mentioned in GitHub
jackjhliu/End-to-End-Mandarin-ASR
pytorch
Mentioned in GitHub
say2sarwar/DeepAcid
Mentioned in GitHub
khanhptnk/bandit-nmt
pytorch
Mentioned in GitHub
A-Jacobson/minimal-nmt
pytorch
Mentioned in GitHub
sblayush/summarization
tf
Mentioned in GitHub
KevinYoung98/Seq2Seq-PyTorch
pytorch
Mentioned in GitHub
pcyin/pytorch_basic_nmt
pytorch
Mentioned in GitHub
ldulcic/chatbot
pytorch
Mentioned in GitHub
Shubham-SK/TreeOverAte
pytorch
Mentioned in GitHub
CongBao/ChatBot
Mentioned in GitHub
riversdie/chatbot
pytorch
Mentioned in GitHub
biyoml/Pytorch-End-to-End-ASR-on-TIMIT
pytorch
Mentioned in GitHub
b-etienne/Seq2seq-PyTorch
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
image-guided-story-ending-generation-on-lsmdcSeq2Seq
BLEU-1: 14.21
BLEU-2: 4.56
BLEU-3: 1.70
BLEU-4: 0.70
CIDEr: 8.69
METEOR: 11.01
ROUGE-L: 19.69
image-guided-story-ending-generation-on-vistSeq2Seq
BLEU-1: 13.96
BLEU-2: 5.57
BLEU-3: 2.94
BLEU-4: 1.69
CIDEr: 12.04
METEOR: 4.54
ROUGE-L: 16.84
machine-translation-on-20news12
Accuracy: 1.0
machine-translation-on-wmt2014-english-germanRNN Enc-Dec Att
BLEU score: 20.9
Hardware Burden:
Operations per network pass:
machine-translation-on-wmt2014-english-germanRNN Enc-Dec
BLEU score: 11.3
Hardware Burden:
Operations per network pass:
machine-translation-on-wmt2014-english-germanReverse RNN Enc-Dec
BLEU score: 14.0
Hardware Burden:
Operations per network pass:

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Effective Approaches to Attention-based Neural Machine Translation | Papers | HyperAI