HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

An Empirical Evaluation of Generic Convolutional and Recurrent Networks for Sequence Modeling

Shaojie Bai; J. Zico Kolter; Vladlen Koltun

An Empirical Evaluation of Generic Convolutional and Recurrent Networks for Sequence Modeling

Abstract

For most deep learning practitioners, sequence modeling is synonymous with recurrent networks. Yet recent results indicate that convolutional architectures can outperform recurrent networks on tasks such as audio synthesis and machine translation. Given a new sequence modeling task or dataset, which architecture should one use? We conduct a systematic evaluation of generic convolutional and recurrent architectures for sequence modeling. The models are evaluated across a broad range of standard tasks that are commonly used to benchmark recurrent networks. Our results indicate that a simple convolutional architecture outperforms canonical recurrent networks such as LSTMs across a diverse range of tasks and datasets, while demonstrating longer effective memory. We conclude that the common association between sequence modeling and recurrent networks should be reconsidered, and convolutional networks should be regarded as a natural starting point for sequence modeling tasks. To assist related work, we have made code available at http://github.com/locuslab/TCN .

Code Repositories

philipperemy/keras-tcn
tf
Mentioned in GitHub
Songweiping/TCN-TF
tf
Mentioned in GitHub
proroklab/popgym
pytorch
Mentioned in GitHub
ratschlab/HIRID-ICU-Benchmark
pytorch
Mentioned in GitHub
rvandewater/yaib
pytorch
Mentioned in GitHub
ZTianle/keras-tcn-solar
tf
Mentioned in GitHub
zll1996/TCN
tf
Mentioned in GitHub
zhong110020/keras-tcn
tf
Mentioned in GitHub
linxi159/TCN
pytorch
Mentioned in GitHub
locuslab/TCN
Official
pytorch
Mentioned in GitHub
YuanTingHsieh/TF_TCN
tf
Mentioned in GitHub
hkchengrex/TCN
pytorch
Mentioned in GitHub
jxz542189/TCN_classification
tf
Mentioned in GitHub
MChen9/TCN
tf
Mentioned in GitHub
zhong110020/Tensorflow-TCN
tf
Mentioned in GitHub
ShotDownDiane/tcn-master
tf
Mentioned in GitHub
zhong110020/TensorFlow_TCN
tf
Mentioned in GitHub
Baichenjia/Tensorflow-TCN
tf
Mentioned in GitHub
zhong110020/pytorch_TCN
pytorch
Mentioned in GitHub
mhjabreel/CharCnn_Keras
tf
Mentioned in GitHub
jakeret/tcn
tf
Mentioned in GitHub
sindhura97/STraTS
pytorch
Mentioned in GitHub
ashishpatel26/tcn-keras-Examples
pytorch
Mentioned in GitHub
WenjieDu/PyPOTS
pytorch
Mentioned in GitHub
patHutchings/TCN
pytorch
Mentioned in GitHub
Nic5472K/FriendsOOGroup_TCN
pytorch
Mentioned in GitHub
kingcong/TCN
mindspore
Mentioned in GitHub
selmiss/gp-tlstgcn
pytorch
Mentioned in GitHub
anandharaju/Basic_TCN
tf
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
language-modelling-on-penn-treebank-characterTemporal Convolutional Network
Bit per Character (BPC): 1.31
language-modelling-on-penn-treebank-wordLSTM (Bai et al., 2018)
Test perplexity: 78.93
language-modelling-on-penn-treebank-wordGRU (Bai et al., 2018)
Test perplexity: 92.48
language-modelling-on-wikitext-103TCN
Test perplexity: 45.19
music-modeling-on-jsb-choralesTCN
NLL: 8.10
music-modeling-on-nottinghamGRU
NLL: 3.46
music-modeling-on-nottinghamLSTM
NLL: 3.29
music-modeling-on-nottinghamTCN
NLL: 3.07
music-modeling-on-nottinghamRNN
NLL: 4.05
sequential-image-classification-on-sequentialTemporal Convolutional Network
Permuted Accuracy: 97.2%
Unpermuted Accuracy: 99.0%

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
An Empirical Evaluation of Generic Convolutional and Recurrent Networks for Sequence Modeling | Papers | HyperAI