HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

Enhancing Video Super-Resolution via Implicit Resampling-based Alignment

Kai Xu Ziwei Yu Xin Wang Michael Bi Mi Angela Yao

Enhancing Video Super-Resolution via Implicit Resampling-based Alignment

Abstract

In video super-resolution, it is common to use a frame-wise alignment to support the propagation of information over time. The role of alignment is well-studied for low-level enhancement in video, but existing works overlook a critical step -- resampling. We show through extensive experiments that for alignment to be effective, the resampling should preserve the reference frequency spectrum while minimizing spatial distortions. However, most existing works simply use a default choice of bilinear interpolation for resampling even though bilinear interpolation has a smoothing effect and hinders super-resolution. From these observations, we propose an implicit resampling-based alignment. The sampling positions are encoded by a sinusoidal positional encoding, while the value is estimated with a coordinate network and a window-based cross-attention. We show that bilinear interpolation inherently attenuates high-frequency information while an MLP-based coordinate network can approximate more frequencies. Experiments on synthetic and real-world datasets show that alignment with our proposed implicit resampling enhances the performance of state-of-the-art frameworks with minimal impact on both compute and parameters.

Code Repositories

kai422/iart
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
video-super-resolution-on-reds4-4x-upscalingIART
PSNR: 32.90
SSIM: 0.9138
video-super-resolution-on-vid4-4x-upscalingIART
PSNR: 28.26
SSIM: 0.8517

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Enhancing Video Super-Resolution via Implicit Resampling-based Alignment | Papers | HyperAI