HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

Temporal Modulation Network for Controllable Space-Time Video Super-Resolution

Gang Xu; Jun Xu; Zhen Li; Liang Wang; Xing Sun; Ming-Ming Cheng

Temporal Modulation Network for Controllable Space-Time Video Super-Resolution

Abstract

Space-time video super-resolution (STVSR) aims to increase the spatial and temporal resolutions of low-resolution and low-frame-rate videos. Recently, deformable convolution based methods have achieved promising STVSR performance, but they could only infer the intermediate frame pre-defined in the training stage. Besides, these methods undervalued the short-term motion cues among adjacent frames. In this paper, we propose a Temporal Modulation Network (TMNet) to interpolate arbitrary intermediate frame(s) with accurate high-resolution reconstruction. Specifically, we propose a Temporal Modulation Block (TMB) to modulate deformable convolution kernels for controllable feature interpolation. To well exploit the temporal information, we propose a Locally-temporal Feature Comparison (LFC) module, along with the Bi-directional Deformable ConvLSTM, to extract short-term and long-term motion cues in videos. Experiments on three benchmark datasets demonstrate that our TMNet outperforms previous STVSR methods. The code is available at https://github.com/CS-GangXu/TMNet.

Code Repositories

CS-GangXu/TMNet
Official
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
video-super-resolution-on-msu-super-1TMNet + x264
BSQ-rate over ERQA: 1.879
BSQ-rate over LPIPS: 1.377
BSQ-rate over MS-SSIM: 0.844
BSQ-rate over PSNR: 1.481
BSQ-rate over VMAF: 1.061
video-super-resolution-on-msu-super-1TMNet + uavs3e
BSQ-rate over ERQA: 13.187
BSQ-rate over LPIPS: 5.015
BSQ-rate over MS-SSIM: 4.317
BSQ-rate over PSNR: 15.144
BSQ-rate over VMAF: 3.487
video-super-resolution-on-msu-super-1TMNet + vvenc
BSQ-rate over ERQA: 21.303
BSQ-rate over LPIPS: 13.988
BSQ-rate over MS-SSIM: 1.813
BSQ-rate over PSNR: 9.43
BSQ-rate over VMAF: 1.795
video-super-resolution-on-msu-super-1TMNet + x265
BSQ-rate over ERQA: 13.577
BSQ-rate over LPIPS: 13.485
BSQ-rate over MS-SSIM: 1.735
BSQ-rate over PSNR: 7.046
BSQ-rate over VMAF: 2.009
video-super-resolution-on-msu-super-1TMNet + aomenc
BSQ-rate over ERQA: 21.798
BSQ-rate over LPIPS: 6.276
BSQ-rate over MS-SSIM: 10.322
BSQ-rate over PSNR: 15.144
BSQ-rate over VMAF: 4.667
video-super-resolution-on-msu-vsr-benchmarkTMNet
1 - LPIPS: 0.931
ERQAv1.0: 0.712
FPS: 1.136
PSNR: 30.364
QRCRv1.0: 0.549
SSIM: 0.885
Subjective score: 6

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Temporal Modulation Network for Controllable Space-Time Video Super-Resolution | Papers | HyperAI