5 months ago

An Empirical Evaluation of Generic Convolutional and Recurrent Networks for Sequence Modeling

Shaojie Bai; J. Zico Kolter; Vladlen Koltun

Abstract

For most deep learning practitioners, sequence modeling is synonymous with recurrent networks. Yet recent results indicate that convolutional architectures can outperform recurrent networks on tasks such as audio synthesis and machine translation. Given a new sequence modeling task or dataset, which architecture should one use? We conduct a systematic evaluation of generic convolutional and recurrent architectures for sequence modeling. The models are evaluated across a broad range of standard tasks that are commonly used to benchmark recurrent networks. Our results indicate that a simple convolutional architecture outperforms canonical recurrent networks such as LSTMs across a diverse range of tasks and datasets, while demonstrating longer effective memory. We conclude that the common association between sequence modeling and recurrent networks should be reconsidered, and convolutional networks should be regarded as a natural starting point for sequence modeling tasks. To assist related work, we have made code available at http://github.com/locuslab/TCN .

Code Repositories

XiaowanLi2018/TimeSeriesPrediction_BasedOnCNN

Mentioned in GitHub

philipperemy/keras-tcn

Mentioned in GitHub

Songweiping/TCN-TF

Mentioned in GitHub

proroklab/popgym

pytorch

Mentioned in GitHub

ratschlab/HIRID-ICU-Benchmark

pytorch

Mentioned in GitHub

rvandewater/yaib

pytorch

Mentioned in GitHub

ZTianle/keras-tcn-solar

Mentioned in GitHub

zll1996/TCN

Mentioned in GitHub

zhong110020/keras-tcn

Mentioned in GitHub

linxi159/TCN

pytorch

Mentioned in GitHub

mindspore-ai/models/tree/master/research/cv/TCN

mindspore

locuslab/TCN

Official

pytorch

Mentioned in GitHub

YuanTingHsieh/TF_TCN

Mentioned in GitHub

hkchengrex/TCN

pytorch

Mentioned in GitHub

jxz542189/TCN_classification

Mentioned in GitHub

sucheta19/Text-Classification-Using-CNN

Mentioned in GitHub

MChen9/TCN

Mentioned in GitHub

tw-yuhsi/a-new-perspective-for-shuttlecock-hitting-event-detection

pytorch

Mentioned in GitHub

zhong110020/Tensorflow-TCN

Mentioned in GitHub

ShotDownDiane/tcn-master

Mentioned in GitHub

zhong110020/TensorFlow_TCN

Mentioned in GitHub

Baichenjia/Tensorflow-TCN

Mentioned in GitHub

zhong110020/pytorch_TCN

pytorch

Mentioned in GitHub

mhjabreel/CharCnn_Keras

Mentioned in GitHub

wish44165/A-New-Perspective-for-Shuttlecock-Hitting-Event-Detection

pytorch

Mentioned in GitHub

timeseriesAI/tsai/tree/main/tsai/models

pytorch

jakeret/tcn

Mentioned in GitHub

sindhura97/STraTS

pytorch

Mentioned in GitHub

ashishpatel26/tcn-keras-Examples

pytorch

Mentioned in GitHub

WenjieDu/PyPOTS

pytorch

Mentioned in GitHub

patHutchings/TCN

pytorch

Mentioned in GitHub

Nic5472K/FriendsOOGroup_TCN

pytorch

Mentioned in GitHub

kingcong/TCN

mindspore

Mentioned in GitHub

selmiss/gp-tlstgcn

pytorch

Mentioned in GitHub

anandharaju/Basic_TCN

Mentioned in GitHub

Benchmarks

Benchmark	Methodology	Metrics
language-modelling-on-penn-treebank-character	Temporal Convolutional Network	Bit per Character (BPC): 1.31
language-modelling-on-penn-treebank-word	LSTM (Bai et al., 2018)	Test perplexity: 78.93
language-modelling-on-penn-treebank-word	GRU (Bai et al., 2018)	Test perplexity: 92.48
language-modelling-on-wikitext-103	TCN	Test perplexity: 45.19
music-modeling-on-jsb-chorales	TCN	NLL: 8.10
music-modeling-on-nottingham	GRU	NLL: 3.46
music-modeling-on-nottingham	LSTM	NLL: 3.29
music-modeling-on-nottingham	TCN	NLL: 3.07
music-modeling-on-nottingham	RNN	NLL: 4.05
sequential-image-classification-on-sequential	Temporal Convolutional Network	Permuted Accuracy: 97.2% Unpermuted Accuracy: 99.0%

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started

Hyper Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

Command Palette

An Empirical Evaluation of Generic Convolutional and Recurrent Networks for Sequence Modeling

Shaojie Bai; J. Zico Kolter; Vladlen Koltun

Abstract

Code Repositories

Benchmarks

Build AI with AI

Hyper Newsletters