HyperAIHyperAI

Command Palette

Search for a command to run...

4 months ago

Plug-and-Play Tri-Branch Invertible Block for Image Rescaling

Bao Jingwei ; Hao Jinhua ; Xu Pengcheng ; Sun Ming ; Zhou Chao ; Zhu Shuyuan

Plug-and-Play Tri-Branch Invertible Block for Image Rescaling

Abstract

High-resolution (HR) images are commonly downscaled to low-resolution (LR) toreduce bandwidth, followed by upscaling to restore their original details.Recent advancements in image rescaling algorithms have employed invertibleneural networks (INNs) to create a unified framework for downscaling andupscaling, ensuring a one-to-one mapping between LR and HR images. Traditionalmethods, utilizing dual-branch based vanilla invertible blocks, processhigh-frequency and low-frequency information separately, often relying onspecific distributions to model high-frequency components. However, processingthe low-frequency component directly in the RGB domain introduces channelredundancy, limiting the efficiency of image reconstruction. To address thesechallenges, we propose a plug-and-play tri-branch invertible block(T-InvBlocks) that decomposes the low-frequency branch into luminance (Y) andchrominance (CbCr) components, reducing redundancy and enhancing featureprocessing. Additionally, we adopt an all-zero mapping strategy forhigh-frequency components during upscaling, focusing essential rescalinginformation within the LR image. Our T-InvBlocks can be seamlessly integratedinto existing rescaling models, improving performance in both general rescalingtasks and scenarios involving lossy compression. Extensive experiments confirmthat our method advances the state of the art in HR image reconstruction.

Code Repositories

jingwei-bao/t-invblocks
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
image-rescaling-on-bsd100-2xT-IRN
PSNR: 42.68
SSIM: 0.9913
image-rescaling-on-bsd100-4xT-IRN
PSNR: 31.64
SSIM: 0.8837
image-rescaling-on-div2k-val-2xT-IRN
PSNR: 45.46
SSIM: 0.9932
image-rescaling-on-div2k-val-4xT-IRN
PSNR: 35.10
SSIM: 0.9328
image-rescaling-on-div2k-val-q30-2xT-SAIN
PSNR: 31.89
SSIM: 0.8912
image-rescaling-on-div2k-val-q30-4xT-SAIN
PSNR: 28.08
SSIM: 0.7893
image-rescaling-on-div2k-val-q50-2xT-SAIN
PSNR: 33.71
SSIM: 0.9210
image-rescaling-on-div2k-val-q50-4xT-SAIN
PSNR: 29.43
SSIM: 0.8237
image-rescaling-on-div2k-val-q70-2xT-SAIN
PSNR: 35.20
SSIM: 0.9384
image-rescaling-on-div2k-val-q70-4xT-SAIN
PSNR: 30.34
SSIM: 0.8421
image-rescaling-on-div2k-val-q90-2xT-SAIN
PSNR: 36.30
SSIM: 0.9478
image-rescaling-on-div2k-val-q90-4xT-SAIN
PSNR: 30.92
SSIM: 0.8517
image-rescaling-on-set14-2xT-IRN
PSNR: 41.70
SSIM: 0.9809
image-rescaling-on-set14-4xT-IRN
PSNR: 32.70
SSIM: 0.9003
image-rescaling-on-set5-2xT-IRN
PSNR: 44.86
SSIM: 0.9883
image-rescaling-on-set5-4xT-IRN
PSNR: 36.29
SSIM: 0.9452
image-rescaling-on-urban100-2xT-IRN
PSNR: 41.05
SSIM: 0.9899
image-rescaling-on-urban100-4xT-IRN
PSNR: 31.19
SSIM: 0.9132

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp