Command Palette
Search for a command to run...
{Xinyi Peng Xianfang Sun Longcun Jin Hua Wang Dewei Su}
Abstract
The goal of video super-resolution technique is to address the problem of effectively restoring high-resolution (HR) videos from low-resolution (LR) ones. Previous methods commonly used optical flow to perform frame alignment and designed a framework from the perspective of space and time. However, inaccurate optical flow estimation may occur easily which leads to inferior restoration effects. In addition, how to effectively fuse the features of various video frames remains a challenging problem. In this paper, we propose a Local-Global Fusion Network (LGFN) to solve the above issues from a novel viewpoint. As an alternative to optical flow, deformable convolutions (DCs) with decreased multi-dilation convolution units (DMDCUs) are applied for efficient implicit alignment. Moreover, a structure with two branches, consisting of a Local Fusion Module (LFM) and a Global Fusion Module (GFM), is proposed to combine information from two different aspects. Specifically, LFM focuses on the relationship between adjacent frames and maintains the temporal consistency while GFM attempts to take advantage of all related features globally with a video shuffle strategy. Benefiting from our advanced network, experimental results on several datasets demonstrate that our LGFN can not only achieve comparative performance with state-of-the-art methods but also possess reliable ability on restoring a variety of video frames. The results on benchmark datasets of our LGFN are presented on https://github.com/BIOINSu/LGFN and the source code will be released as soon as the paper is accepted.
Benchmarks
| Benchmark | Methodology | Metrics |
|---|---|---|
| video-super-resolution-on-msu-super-1 | LGFN + aomenc | BSQ-rate over ERQA: 14.631 BSQ-rate over LPIPS: 5.536 BSQ-rate over MS-SSIM: 4.321 BSQ-rate over PSNR: 9.79 BSQ-rate over VMAF: 1.99 |
| video-super-resolution-on-msu-super-1 | LGFN + x264 | BSQ-rate over ERQA: 1.704 BSQ-rate over LPIPS: 1.324 BSQ-rate over MS-SSIM: 0.77 BSQ-rate over PSNR: 1.151 BSQ-rate over VMAF: 0.744 |
| video-super-resolution-on-msu-super-1 | LGFN + vvenc | BSQ-rate over ERQA: 18.342 BSQ-rate over LPIPS: 11.759 BSQ-rate over MS-SSIM: 0.889 BSQ-rate over PSNR: 5.768 BSQ-rate over Subjective Score: 2.944 BSQ-rate over VMAF: 1.626 |
| video-super-resolution-on-msu-super-1 | LGFN + x265 | BSQ-rate over ERQA: 13.213 BSQ-rate over LPIPS: 11.399 BSQ-rate over MS-SSIM: 1.533 BSQ-rate over PSNR: 6.646 BSQ-rate over VMAF: 1.341 |
| video-super-resolution-on-msu-super-1 | LGFN + uavs3e | BSQ-rate over ERQA: 9.279 BSQ-rate over LPIPS: 4.504 BSQ-rate over MS-SSIM: 2.427 BSQ-rate over PSNR: 5.503 BSQ-rate over VMAF: 1.625 |
| video-super-resolution-on-msu-video-upscalers | LGFN | PSNR: 27.42 SSIM: 0.939 VMAF: 57.79 |
| video-super-resolution-on-msu-vsr-benchmark | LGFN | 1 - LPIPS: 0.903 ERQAv1.0: 0.74 FPS: 0.667 PSNR: 31.291 QRCRv1.0: 0.629 SSIM: 0.898 Subjective score: 6.505 |
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.