HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

Spatio-Temporal Deformable Attention Network for Video Deblurring

Huicong Zhang Haozhe Xie Hongxun Yao

Spatio-Temporal Deformable Attention Network for Video Deblurring

Abstract

The key success factor of the video deblurring methods is to compensate for the blurry pixels of the mid-frame with the sharp pixels of the adjacent video frames. Therefore, mainstream methods align the adjacent frames based on the estimated optical flows and fuse the alignment frames for restoration. However, these methods sometimes generate unsatisfactory results because they rarely consider the blur levels of pixels, which may introduce blurry pixels from video frames. Actually, not all the pixels in the video frames are sharp and beneficial for deblurring. To address this problem, we propose the spatio-temporal deformable attention network (STDANet) for video delurring, which extracts the information of sharp pixels by considering the pixel-wise blur levels of the video frames. Specifically, STDANet is an encoder-decoder network combined with the motion estimator and spatio-temporal deformable attention (STDA) module, where motion estimator predicts coarse optical flows that are used as base offsets to find the corresponding sharp pixels in STDA module. Experimental results indicate that the proposed STDANet performs favorably against state-of-the-art methods on the GoPro, DVD, and BSD datasets.

Code Repositories

huicongzhang/stdan
Official
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
deblurring-on-dvd-1STDAN
PSNR: 33.05
SSIM: 0.9374
deblurring-on-goproSTDAN
PSNR: 32.29
SSIM: 0.9313

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Spatio-Temporal Deformable Attention Network for Video Deblurring | Papers | HyperAI