HyperAIHyperAI

Command Palette

Search for a command to run...

4 months ago

Sequence to Sequence Learning with Neural Networks

Ilya Sutskever; Oriol Vinyals; Quoc V. Le

Sequence to Sequence Learning with Neural Networks

Abstract

Deep Neural Networks (DNNs) are powerful models that have achieved excellent performance on difficult learning tasks. Although DNNs work well whenever large labeled training sets are available, they cannot be used to map sequences to sequences. In this paper, we present a general end-to-end approach to sequence learning that makes minimal assumptions on the sequence structure. Our method uses a multilayered Long Short-Term Memory (LSTM) to map the input sequence to a vector of a fixed dimensionality, and then another deep LSTM to decode the target sequence from the vector. Our main result is that on an English to French translation task from the WMT'14 dataset, the translations produced by the LSTM achieve a BLEU score of 34.8 on the entire test set, where the LSTM's BLEU score was penalized on out-of-vocabulary words. Additionally, the LSTM did not have difficulty on long sentences. For comparison, a phrase-based SMT system achieves a BLEU score of 33.3 on the same dataset. When we used the LSTM to rerank the 1000 hypotheses produced by the aforementioned SMT system, its BLEU score increases to 36.5, which is close to the previous best result on this task. The LSTM also learned sensible phrase and sentence representations that are sensitive to word order and are relatively invariant to the active and the passive voice. Finally, we found that reversing the order of the words in all source sentences (but not target sentences) improved the LSTM's performance markedly, because doing so introduced many short term dependencies between the source and the target sentence which made the optimization problem easier.

Code Repositories

Mind23-2/MindCode-73
mindspore
Mentioned in GitHub
TEAMLAB-Lecture/deep_nlp_101
tf
Mentioned in GitHub
2023-MindSpore-1/ms-code-62
mindspore
Mentioned in GitHub
umeiko/mindspore-seq2seq
mindspore
Mentioned in GitHub
moon23k/LSTM_Anchors
pytorch
Mentioned in GitHub
hnt4499/seq2seq
pytorch
Mentioned in GitHub
2023-MindSpore-1/ms-code-200
mindspore
Mentioned in GitHub
farizrahman4u/seq2seq
Mentioned in GitHub
matken11235/keras-seq2seq
tf
Mentioned in GitHub
karlie38/torchstudy
pytorch
Mentioned in GitHub
cosmoquester/seq2seq
tf
Mentioned in GitHub
sayhitosandy/Chatbot
tf
Mentioned in GitHub
2023-MindSpore-1/ms-code-197
mindspore
Mentioned in GitHub
dksifoua/Neural-Machine-Translation
pytorch
Mentioned in GitHub
starry91/NMT-Lab
Mentioned in GitHub
yash-nishaant/Seq2Seq-Chatbot
tf
Mentioned in GitHub
dongdong199408/teachchatrobot
tf
Mentioned in GitHub
trqminh/seq2seq
pytorch
Mentioned in GitHub
ArushiSinghal/NMT-assignment1
pytorch
Mentioned in GitHub
kingcong/gpu_gru
mindspore
Mentioned in GitHub
jklafka/noisy-nets
pytorch
Mentioned in GitHub
yangyucheng000/ascend_gru
mindspore
Mentioned in GitHub
mits58/Pointer-Networks
Mentioned in GitHub
2023-MindSpore-1/ms-code-61
mindspore
Mentioned in GitHub
vGkatsis/Chat_Bot_DL
pytorch
Mentioned in GitHub
AMNAALMGLY/NLP
Mentioned in GitHub
Guillem96/pointer-nn-pytorch
pytorch
Mentioned in GitHub
giovanniguidi/Seq-2-Seq-OCR
Mentioned in GitHub
2023-MindSpore-1/ms-code-198
mindspore
Mentioned in GitHub
kirillermolov/ChatBot
tf
Mentioned in GitHub
a1085728420/seq2seq-mindspore
mindspore
Mentioned in GitHub
nouhadziri/THRED
tf
Mentioned in GitHub
bentrevett/pytorch-seq2seq
pytorch
Mentioned in GitHub
dltcls/g2p_pytorch_wiktionary
pytorch
Mentioned in GitHub
lakshmichaitanyach/project_2
tf
Mentioned in GitHub
yscope75/CS2225.CH2001020
pytorch
Mentioned in GitHub
CGCL-codes/naturalcc
pytorch
Mentioned in GitHub
mlennox/summarisers
tf
Mentioned in GitHub
moon23k/RNN_Seq2Seq
pytorch
Mentioned in GitHub
isi-nlp/Zoph_RNN
Mentioned in GitHub
weiylu/NLP
tf
Mentioned in GitHub
macournoyer/neuralconvo
pytorch
Mentioned in GitHub
VishalFun/Keras_practice
Mentioned in GitHub
Chatha-Sphere/pno-ai
pytorch
Mentioned in GitHub
francescodisalvo05/66DaysOfData
pytorch
Mentioned in GitHub
minsoo9506/NLP-study
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
machine-translation-on-wmt2014-english-frenchLSTM
BLEU score: 34.8
machine-translation-on-wmt2014-english-frenchSMT+LSTM5
BLEU score: 36.5
time-series-forecasting-on-pemsd7FC-LSTM
9 steps MAE: 4.16
traffic-prediction-on-pems-mFC-LSTM
MAE (60 min): 4.16

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Sequence to Sequence Learning with Neural Networks | Papers | HyperAI