HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

Efficient Frequency Domain-based Transformers for High-Quality Image Deblurring

Lingshun Kong Jiangxin Dong Mingqiang Li Jianjun Ge Jinshan Pan

Efficient Frequency Domain-based Transformers for High-Quality Image Deblurring

Abstract

We present an effective and efficient method that explores the properties of Transformers in the frequency domain for high-quality image deblurring. Our method is motivated by the convolution theorem that the correlation or convolution of two signals in the spatial domain is equivalent to an element-wise product of them in the frequency domain. This inspires us to develop an efficient frequency domain-based self-attention solver (FSAS) to estimate the scaled dot-product attention by an element-wise product operation instead of the matrix multiplication in the spatial domain. In addition, we note that simply using the naive feed-forward network (FFN) in Transformers does not generate good deblurred results. To overcome this problem, we propose a simple yet effective discriminative frequency domain-based FFN (DFFN), where we introduce a gated mechanism in the FFN based on the Joint Photographic Experts Group (JPEG) compression algorithm to discriminatively determine which low- and high-frequency information of the features should be preserved for latent clear image restoration. We formulate the proposed FSAS and DFFN into an asymmetrical network based on an encoder and decoder architecture, where the FSAS is only used in the decoder module for better image deblurring. Experimental results show that the proposed method performs favorably against the state-of-the-art approaches. Code will be available at \url{https://github.com/kkkls/FFTformer}.

Code Repositories

kkkls/fftformer
Official
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
deblurring-on-hide-trained-on-goproFFTformer
PSNR (sRGB): 31.62
Params (M): 16.6
SSIM (sRGB): 0.9455
deblurring-on-realblur-j-1FFTformer
PSNR (sRGB): 32.62
SSIM (sRGB): 0.9326
deblurring-on-realblur-rFFTformer
PSNR (sRGB): 40.11
SSIM (sRGB): 0.9737
image-deblurring-on-goprofftformer
PSNR: 34.21
Params (M): 16.6
SSIM: 0.969

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Efficient Frequency Domain-based Transformers for High-Quality Image Deblurring | Papers | HyperAI