4 months ago

PredRNN++: Towards A Resolution of the Deep-in-Time Dilemma in Spatiotemporal Predictive Learning

Yunbo Wang; Zhifeng Gao; Mingsheng Long; Jianmin Wang; Philip S. Yu

Abstract

We present PredRNN++, an improved recurrent network for video predictive learning. In pursuit of a greater spatiotemporal modeling capability, our approach increases the transition depth between adjacent states by leveraging a novel recurrent unit, which is named Causal LSTM for re-organizing the spatial and temporal memories in a cascaded mechanism. However, there is still a dilemma in video predictive learning: increasingly deep-in-time models have been designed for capturing complex variations, while introducing more difficulties in the gradient back-propagation. To alleviate this undesirable effect, we propose a Gradient Highway architecture, which provides alternative shorter routes for gradient flows from outputs back to long-range inputs. This architecture works seamlessly with causal LSTMs, enabling PredRNN++ to capture short-term and long-term dependencies adaptively. We assess our model on both synthetic and real video datasets, showing its ability to ease the vanishing gradient problem and yield state-of-the-art prediction results even in a difficult objects occlusion scenario.

Code Repositories

2023-MindSpore-1/ms-code-215/tree/main/predrnn%2B%2B

mindspore

Mind23-2/MindCode-5/tree/main/predrnn%2B%2B

mindspore

dzhv/Spatio-Temporal-mobile-traffic-forecasting

Mentioned in GitHub

MindSpore-paper-code-2/code2/tree/main/predrnn%2B%2B

mindspore

MS-Mind/MS-Code-06/tree/main/predrnn%2B%2B

mindspore

stevenolvil/PredRNN-V2

mindspore

Mentioned in GitHub

Yunbo426/predrnn-pp

Official

Mentioned in GitHub

Flunzmas/vp-suite

pytorch

code-implementation1/Code6/tree/main/predrnn%2B%2B

mindspore

thuml/predrnn-pytorch

pytorch

mindspore-ai/models/tree/master/official/cv/predrnn%2B%2B

mindspore

Benchmarks

Benchmark	Methodology	Metrics
video-prediction-on-kth	PredRNN++	Cond: 10 PSNR: 28.47 Pred: 20 SSIM: 0.865
video-prediction-on-moving-mnist	Causal LSTM	MAE: 106.8 MSE: 46.5 SSIM: 0.898
video-prediction-on-synpickvp	PredRNN++	LPIPS: 0.053 MSE: 51.73 PSNR: 27.50 SSIM: 0.894

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started

Hyper Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

Command Palette

PredRNN++: Towards A Resolution of the Deep-in-Time Dilemma in Spatiotemporal Predictive Learning

Yunbo Wang; Zhifeng Gao; Mingsheng Long; Jianmin Wang; Philip S. Yu

Abstract

Code Repositories

Benchmarks

Build AI with AI

Hyper Newsletters