HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

Temporal Attention Unit: Towards Efficient Spatiotemporal Predictive Learning

Cheng Tan Zhangyang Gao Lirong Wu Yongjie Xu Jun Xia Siyuan Li Stan Z. Li

Temporal Attention Unit: Towards Efficient Spatiotemporal Predictive Learning

Abstract

Spatiotemporal predictive learning aims to generate future frames by learning from historical frames. In this paper, we investigate existing methods and present a general framework of spatiotemporal predictive learning, in which the spatial encoder and decoder capture intra-frame features and the middle temporal module catches inter-frame correlations. While the mainstream methods employ recurrent units to capture long-term temporal dependencies, they suffer from low computational efficiency due to their unparallelizable architectures. To parallelize the temporal module, we propose the Temporal Attention Unit (TAU), which decomposes the temporal attention into intra-frame statical attention and inter-frame dynamical attention. Moreover, while the mean squared error loss focuses on intra-frame errors, we introduce a novel differential divergence regularization to take inter-frame variations into account. Extensive experiments demonstrate that the proposed method enables the derived model to achieve competitive performance on various spatiotemporal prediction benchmarks.

Code Repositories

chengtan9907/simvpv2
pytorch
Mentioned in GitHub
chengtan9907/OpenSTL
Official
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
video-prediction-on-moving-mnistTAU
MAE: 60.3
MSE: 19.8
SSIM: 0.957

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp