HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

Convolutional Sequence to Sequence Learning

Jonas Gehring; Michael Auli; David Grangier; Denis Yarats; Yann N. Dauphin

Convolutional Sequence to Sequence Learning

Abstract

The prevalent approach to sequence to sequence learning maps an input sequence to a variable length output sequence via recurrent neural networks. We introduce an architecture based entirely on convolutional neural networks. Compared to recurrent models, computations over all elements can be fully parallelized during training and optimization is easier since the number of non-linearities is fixed and independent of the input length. Our use of gated linear units eases gradient propagation and we equip each decoder layer with a separate attention module. We outperform the accuracy of the deep LSTM setup of Wu et al. (2016) on both WMT'14 English-German and WMT'14 English-French translation at an order of magnitude faster speed, both on GPU and CPU.

Code Repositories

umeiko/mindspore-seq2seq
mindspore
Mentioned in GitHub
butsugiri/shape
pytorch
Mentioned in GitHub
IBM/pytorch-seq2seq
pytorch
Mentioned in GitHub
MiuGod0126/ConvS2S_Paddle
paddle
Mentioned in GitHub
Izecson/sockeye-1.16.6
tf
Mentioned in GitHub
lucylow/En_francais_si_vous_plait-
pytorch
Mentioned in GitHub
richinkabra/CoVe-BCN
pytorch
Mentioned in GitHub
YizhuLiu/sumlen
pytorch
Mentioned in GitHub
awslabs/sockeye
mxnet
Mentioned in GitHub
xingniu/sockeye
mxnet
Mentioned in GitHub
EdinburghNLP/XSum
pytorch
Mentioned in GitHub
bcmi220/esc4nmt
pytorch
Mentioned in GitHub
facebookresearch/ParlAI
pytorch
Mentioned in GitHub
Helsinki-NLP/OpenNMT-py
pytorch
Mentioned in GitHub
midobal/OpenNMT-py
pytorch
Mentioned in GitHub
albarji/neurowriter
tf
Mentioned in GitHub
Nick-Zhao-Engr/Machine-Translation
pytorch
Mentioned in GitHub
facebookresearch/fairseq
Official
pytorch
Mentioned in GitHub
shashiongithub/XSum
pytorch
Mentioned in GitHub
Deeksha96/Im2Latex
tf
Mentioned in GitHub
siyuofzhou/CNNSeqToSeq
pytorch
Mentioned in GitHub
memray/OpenNMT-kpg-release
pytorch
Mentioned in GitHub
ZurichNLP/sockeye
mxnet
Mentioned in GitHub
Izecson/saml-nmt
mxnet
Mentioned in GitHub
CKPOON0619/Kaggle2Sigma
Mentioned in GitHub
phanideepgampa/IM2LATEX
tf
Mentioned in GitHub
nrc-cnrc/sockeye-multisource
mxnet
Mentioned in GitHub
OpenNMT/OpenNMT-py
pytorch
Mentioned in GitHub
yinghao1019/NLP_and_DL_practice
pytorch
Mentioned in GitHub
oneTimePad/conv-nmt
tf
Mentioned in GitHub
cove-adml/adml-anon
pytorch
Mentioned in GitHub
Kakoedlinnoeslovo/fairseq
tf
Mentioned in GitHub
thinkwee/DPP_CNN_Summarization
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
bangla-spelling-error-correction-on-dpcspellConvSeq2Seq
Exact Match Accuracy: 78.85%
image-classification-on-mnistCNN Model by Som
Accuracy: 98.59
Percentage error: 1.41
machine-translation-on-iwslt2015-englishConvS2S
BLEU score: 26.73
machine-translation-on-iwslt2015-germanConvS2S
BLEU score: 32.31
machine-translation-on-wmt2014-english-frenchConvS2S (ensemble)
BLEU score: 41.3
Hardware Burden:
Operations per network pass:
machine-translation-on-wmt2014-english-frenchConvS2S
BLEU score: 40.46
Hardware Burden: 143G
Operations per network pass:
machine-translation-on-wmt2014-english-germanConvS2S
BLEU score: 25.16
Hardware Burden: 72G
Operations per network pass:
machine-translation-on-wmt2014-english-germanConvS2S (ensemble)
BLEU score: 26.4
Hardware Burden: 54G
Operations per network pass:
machine-translation-on-wmt2016-english-1ConvS2S BPE40k
BLEU score: 29.9

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Convolutional Sequence to Sequence Learning | Papers | HyperAI