5 months ago

Neural Machine Translation by Jointly Learning to Align and Translate

Dzmitry Bahdanau; Kyunghyun Cho; Yoshua Bengio

Abstract

Neural machine translation is a recently proposed approach to machine translation. Unlike the traditional statistical machine translation, the neural machine translation aims at building a single neural network that can be jointly tuned to maximize the translation performance. The models proposed recently for neural machine translation often belong to a family of encoder-decoders and consists of an encoder that encodes a source sentence into a fixed-length vector from which a decoder generates a translation. In this paper, we conjecture that the use of a fixed-length vector is a bottleneck in improving the performance of this basic encoder-decoder architecture, and propose to extend this by allowing a model to automatically (soft-)search for parts of a source sentence that are relevant to predicting a target word, without having to form these parts as a hard segment explicitly. With this new approach, we achieve a translation performance comparable to the existing state-of-the-art phrase-based system on the task of English-to-French translation. Furthermore, qualitative analysis reveals that the (soft-)alignments found by the model agree well with our intuition.

Code Repositories

yurayli/stanford-cs224n-sol

pytorch

Mentioned in GitHub

hiun/learning-transformers

pytorch

Mentioned in GitHub

mike-a-yen/date-translation

pytorch

Mentioned in GitHub

simonjisu/NMT

pytorch

Mentioned in GitHub

sooftware/attentions

pytorch

Mentioned in GitHub

jmyrberg/finnlem

Mentioned in GitHub

abhishekshakya/seq-2-seq-for-neural-machine-translation-english-to-hindi-

Mentioned in GitHub

prakhargurawa/Neural-Machine-Translation-Keras-Attention

Mentioned in GitHub

EuphoriaYan/ChatRobot-For-Keras2

Mentioned in GitHub

neqkir/attention-mechanism

Mentioned in GitHub

umeiko/mindspore-seq2seq

mindspore

Mentioned in GitHub

daphne12345/SummarizationRadiologyReports

Mentioned in GitHub

atpaino/deep-text-corrector

Mentioned in GitHub

brightmart/text_classification

Mentioned in GitHub

labdac/charlacompling

Mentioned in GitHub

prakhargurawa/Neural-Machine-Translation-Keras-German-English

Mentioned in GitHub

hnt4499/seq2seq

pytorch

Mentioned in GitHub

SimonDele/Glossary

Mentioned in GitHub

2023-MindSpore-1/ms-code-200

mindspore

Cenrax/Image-Captioning-with-Translation-

Mentioned in GitHub

harshithbelagur/Neural-Machine-Translation

Mentioned in GitHub

varun-bhaseen/Image-caption-generation-using-attention-model

Mentioned in GitHub

astorfi/sequence-to-sequence-from-scratch

pytorch

Mentioned in GitHub

schwartznir/AbstrEncap

Mentioned in GitHub

philipperemy/keras-attention-mechanism

Mentioned in GitHub

deepsense-ai/unblackboxing_webinar

Mentioned in GitHub

huulinhcvp/chatBot

pytorch

Mentioned in GitHub

mindspore-courses/DeepNLP-models-MindSpore

mindspore

Mentioned in GitHub

tree-park/kor-to-eng-translation

pytorch

Mentioned in GitHub

farizrahman4u/seq2seq

Mentioned in GitHub

cosmoquester/seq2seq

Mentioned in GitHub

lucylow/En_francais_si_vous_plait-

pytorch

Mentioned in GitHub

Izecson/sockeye-1.16.6

Mentioned in GitHub

Matthewdowney18/Yelp_seq2seq

pytorch

Mentioned in GitHub

leob03/Image_captionning

pytorch

Mentioned in GitHub

gongrennengzhi/nmt

Mentioned in GitHub

JRC1995/Abstractive-Summarization

Mentioned in GitHub

Shubham-SK/kronos

pytorch

Mentioned in GitHub

smisthzhu/attentionocr

Mentioned in GitHub

DCYN/Ramdomized-Clinical-Trail-Classification

Mentioned in GitHub

awslabs/sockeye

mxnet

Mentioned in GitHub

moon23k/Attention_Anchors

pytorch

Mentioned in GitHub

abhaskumarsinha/Seq2Seq-Bahdanau-Attention-based-Encoder-Decoder-Language-Translator

YvesWang/Machine_Translation_NLP

pytorch

Mentioned in GitHub

dongdong199408/teachchatrobot

Mentioned in GitHub

mp2893/gram

Mentioned in GitHub

vikua/keras-attention-models

Mentioned in GitHub

xingniu/sockeye

mxnet

Mentioned in GitHub

Maab-Nimir/Neural-Machine-Translation-by-Jointly-Learning-to-Align-and-Translate

pytorch

Mentioned in GitHub

sambit9238/deep_text_corrector

Mentioned in GitHub

wangmz15/Chinese-Error-Correction-with-THUMT

Mentioned in GitHub

sh951011/Attention-Implementation

pytorch

Mentioned in GitHub

gongshuangshuang/deep-text-corrector

Mentioned in GitHub

Nick-Zhao-Engr/Machine-Translation

pytorch

Mentioned in GitHub

aaaceo890/Attention

pytorch

Mentioned in GitHub

chao-ji/tf-seq2seq

Mentioned in GitHub

thunlp/TensorFlow-Summarization

Mentioned in GitHub

insigh/THUMT

Mentioned in GitHub

HemaDevaSagar35/NeuralMachineTranslation-French2English

Mentioned in GitHub

SwordYork/DCNMT

Mentioned in GitHub

vGkatsis/Chat_Bot_DL

pytorch

Mentioned in GitHub

thomlake/pytorch-attention

pytorch

Mentioned in GitHub

rileynwong/pytorch-seq2seq-joke2punchline

pytorch

Mentioned in GitHub

abhishekr7/Summarization-of-Radiological-Reports

Mentioned in GitHub

ykrmm/ICLR_2020

pytorch

Mentioned in GitHub

graykode/nlp-tutorial

pytorch

Mentioned in GitHub

AMNAALMGLY/NLP

Mentioned in GitHub

sunnysinghnitb/text_corrector_software

Mentioned in GitHub

suryachintu/Quora-Insincere-Questions-Kaggle

Mentioned in GitHub

Guillem96/pointer-nn-pytorch

pytorch

Mentioned in GitHub

sen-pai/audio-word2vec-pytorch

pytorch

Mentioned in GitHub

slme1109/Lyrics_Generator_Using_LSTM

Mentioned in GitHub

MindSpore-paper-code-3/code9/tree/main/textcnn

mindspore

brainsqueeze/text2vec

Mentioned in GitHub

qq345736500/sarcasm

Mentioned in GitHub

ScientiaEtVeritas/NeuralMachineTranslation

pytorch

Mentioned in GitHub

bentrevett/pytorch-seq2seq

pytorch

Mentioned in GitHub

shlokmehrotra/Convocare

pytorch

Mentioned in GitHub

ZurichNLP/sockeye

mxnet

Mentioned in GitHub

mayurnewase/Translation

Mentioned in GitHub

laserene/English-German-Translation-System

Mentioned in GitHub

frozentoad9/Neural-Machine-Translation

pytorch

Mentioned in GitHub

theamrzaki/text_summurization_abstractive_methods

mpavlovic/insincere-questions-classifier

Mentioned in GitHub

sooftware/nlp-attentions

pytorch

Mentioned in GitHub

A-Jacobson/minimal-nmt

pytorch

Mentioned in GitHub

jiangnanhugo/seq2seq_cuda

Mentioned in GitHub

chunghyunhee/twitter_disaster_NLP

Mentioned in GitHub

Izecson/saml-nmt

mxnet

Mentioned in GitHub

eske/seq2seq

Mentioned in GitHub

datalogue/keras-attention

Mentioned in GitHub

sooftware/Attention-Implementation

pytorch

Mentioned in GitHub

thunlp-mt/ckd

pytorch

Mentioned in GitHub

ykrmm/TREMBA

pytorch

Mentioned in GitHub

slme1109/lyrics-generator

Mentioned in GitHub

la-serene/English-German-Translation-System

Mentioned in GitHub

distractor-generation/dg_survey

Mentioned in GitHub

xhlulu/arxiv-assistant

Mentioned in GitHub

sunnysinghnitb/text-corrector-software

Mentioned in GitHub

nrc-cnrc/sockeye-multisource

mxnet

Mentioned in GitHub

zhang0jhon/AttentionOCR

Mentioned in GitHub

dalmia/Quora-Question-Pairs

Mentioned in GitHub

AaronCCWong/Show-Attend-and-Tell

pytorch

Mentioned in GitHub

THUNLP-MT/THUMT

Mentioned in GitHub

ldulcic/chatbot

pytorch

Mentioned in GitHub

umutguneri/Question-Answering-Assistant

Mentioned in GitHub

Shubham-SK/TreeOverAte

pytorch

Mentioned in GitHub

Matthewdowney18/Yelp_attention

pytorch

Mentioned in GitHub

Glaceon31/Document-Transformer

Mentioned in GitHub

uzi0espil/research-papers-implementation/tree/master/Neural%20Machine%20Translation%20by%20Jointly%20Learning%20to%20Align%20and%20Translate

abmitra84/Machine_Translation

Mentioned in GitHub

yinghao1019/NLP_and_DL_practice

pytorch

Mentioned in GitHub

TellinaTool/nl2bash

Mentioned in GitHub

Baichenjia/NMT-eager

Mentioned in GitHub

thumt/THUMT

Mentioned in GitHub

shawnyxiao/textclassification-keras

Mentioned in GitHub

bkoch4142/attention-is-all-you-need-paper

pytorch

Mentioned in GitHub

nvshrao/AlignAndTranslate

Mentioned in GitHub

lkfo415579/MT-Readling-List

Mentioned in GitHub

astorfi/neural-machine-translation-from-scratch

pytorch

Mentioned in GitHub

IS5882/Open-CyKG

Mentioned in GitHub

IpastorSan/seq2seq-with-attention-OCR-translation

Mentioned in GitHub

Epoch-Mengying/Generating-Poetry-with-Chatbot

Mentioned in GitHub

b-etienne/Seq2seq-PyTorch

pytorch

Mentioned in GitHub

Benchmarks

Benchmark	Methodology	Metrics
bangla-spelling-error-correction-on-dpcspell	GRUSeq2Seq	Exact Match Accuracy: 75.56
dialogue-generation-on-persona-chat-1	Seq2Seq + Attention	Avg F1: 16.18
machine-translation-on-iwslt2015-german	Bi-GRU (MLE+SLE)	BLEU score: 28.53
machine-translation-on-wmt2014-english-french	RNN-search50*	BLEU score: 36.2

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started

Hyper Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

Command Palette

Neural Machine Translation by Jointly Learning to Align and Translate

Dzmitry Bahdanau; Kyunghyun Cho; Yoshua Bengio

Abstract

Code Repositories

Benchmarks

Build AI with AI

Hyper Newsletters