Command Palette
Search for a command to run...
Temporal Modulation Network for Controllable Space-Time Video Super-Resolution
Gang Xu; Jun Xu; Zhen Li; Liang Wang; Xing Sun; Ming-Ming Cheng

Abstract
Space-time video super-resolution (STVSR) aims to increase the spatial and temporal resolutions of low-resolution and low-frame-rate videos. Recently, deformable convolution based methods have achieved promising STVSR performance, but they could only infer the intermediate frame pre-defined in the training stage. Besides, these methods undervalued the short-term motion cues among adjacent frames. In this paper, we propose a Temporal Modulation Network (TMNet) to interpolate arbitrary intermediate frame(s) with accurate high-resolution reconstruction. Specifically, we propose a Temporal Modulation Block (TMB) to modulate deformable convolution kernels for controllable feature interpolation. To well exploit the temporal information, we propose a Locally-temporal Feature Comparison (LFC) module, along with the Bi-directional Deformable ConvLSTM, to extract short-term and long-term motion cues in videos. Experiments on three benchmark datasets demonstrate that our TMNet outperforms previous STVSR methods. The code is available at https://github.com/CS-GangXu/TMNet.
Code Repositories
Benchmarks
| Benchmark | Methodology | Metrics |
|---|---|---|
| video-super-resolution-on-msu-super-1 | TMNet + x264 | BSQ-rate over ERQA: 1.879 BSQ-rate over LPIPS: 1.377 BSQ-rate over MS-SSIM: 0.844 BSQ-rate over PSNR: 1.481 BSQ-rate over VMAF: 1.061 |
| video-super-resolution-on-msu-super-1 | TMNet + uavs3e | BSQ-rate over ERQA: 13.187 BSQ-rate over LPIPS: 5.015 BSQ-rate over MS-SSIM: 4.317 BSQ-rate over PSNR: 15.144 BSQ-rate over VMAF: 3.487 |
| video-super-resolution-on-msu-super-1 | TMNet + vvenc | BSQ-rate over ERQA: 21.303 BSQ-rate over LPIPS: 13.988 BSQ-rate over MS-SSIM: 1.813 BSQ-rate over PSNR: 9.43 BSQ-rate over VMAF: 1.795 |
| video-super-resolution-on-msu-super-1 | TMNet + x265 | BSQ-rate over ERQA: 13.577 BSQ-rate over LPIPS: 13.485 BSQ-rate over MS-SSIM: 1.735 BSQ-rate over PSNR: 7.046 BSQ-rate over VMAF: 2.009 |
| video-super-resolution-on-msu-super-1 | TMNet + aomenc | BSQ-rate over ERQA: 21.798 BSQ-rate over LPIPS: 6.276 BSQ-rate over MS-SSIM: 10.322 BSQ-rate over PSNR: 15.144 BSQ-rate over VMAF: 4.667 |
| video-super-resolution-on-msu-vsr-benchmark | TMNet | 1 - LPIPS: 0.931 ERQAv1.0: 0.712 FPS: 1.136 PSNR: 30.364 QRCRv1.0: 0.549 SSIM: 0.885 Subjective score: 6 |
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.