3 months ago

Flexible Style Image Super-Resolution using Conditional Objective

Seung Ho Park Young Su Moon Nam Ik Cho

Abstract

Recent studies have significantly enhanced the performance of single-image super-resolution (SR) using convolutional neural networks (CNNs). While there can be many high-resolution (HR) solutions for a given input, most existing CNN-based methods do not explore alternative solutions during the inference. A typical approach to obtaining alternative SR results is to train multiple SR models with different loss weightings and exploit the combination of these models. Instead of using multiple models, we present a more efficient method to train a single adjustable SR model on various combinations of losses by taking advantage of multi-task learning. Specifically, we optimize an SR model with a conditional objective during training, where the objective is a weighted sum of multiple perceptual losses at different feature levels. The weights vary according to given conditions, and the set of weights is defined as a style controller. Also, we present an architecture appropriate for this training scheme, which is the Residual-in-Residual Dense Block equipped with spatial feature transformation layers. At the inference phase, our trained model can generate locally different outputs conditioned on the style control map. Extensive experiments show that the proposed SR model produces various desirable reconstructions without artifacts and yields comparable quantitative performance to state-of-the-art SR methods.

Code Repositories

seungho-snu/fxsr

Official

pytorch

Benchmarks

Benchmark	Methodology	Metrics
image-super-resolution-on-bsd100-4x-upscaling	FxSR-PD t=0.8	LPIPS: 0.1572 PSNR: 24.77 SSIM: 0.6817
image-super-resolution-on-bsd100-4x-upscaling	FxSR-PD t=0.0	LPIPS: 0.3433 PSNR: 26.38 SSIM: 0.738
image-super-resolution-on-bsd100-8x-upscaling	FxSR-PD t=0.0	DISTS: 0.2753 LPIPS: 0.5079 LRPSNR: 47.12 NIQE: 5.49 PSNR: 23.6 SSIM: 0.5728
image-super-resolution-on-bsd100-8x-upscaling	FxSR-PD t=0.8	DISTS: 0.1972 LPIPS: 0.3129 LRPSNR: 42.41 NIQE: 4.58 PSNR: 21.93 SSIM: 0.5039
image-super-resolution-on-div2k-val-4x	FxSR-PD t=0.8	DISTS: 0.0513 LPIPS: 0.1028 LRPSNR: 50.54 NIQE: 2.81 PSNR: 27.51 SSIM: 0.789
image-super-resolution-on-div2k-val-4x	FxSR-PD t=0.0	DISTS: 0.1169 LPIPS: 0.239 LRPSNR: 53.3 NIQE: 4.11 PSNR: 29.24 SSIM: 0.8383
image-super-resolution-on-div2k-val-8x	FxSR-PD t=0.8	DISTS: 0.119 LPIPS: 0.2403 LRPSNR: 42.66 NIQE: 3.61 PSNR: 23.56 SSIM: 0.6241
image-super-resolution-on-div2k-val-8x	FxSR-PD t=0.0	DISTS: 0.1953 LPIPS: 0.3857 LRPSNR: 46.96 NIQE: 4.41 PSNR: 25.6 SSIM: 0.6989
image-super-resolution-on-general100-4x	FxSR-PD t=0.8	DISTS: 0.0831 LPIPS: 0.0784 LRPSNR: 49.82 NIQE: 4.54 PSNR: 28.44 SSIM: 0.8229
image-super-resolution-on-general100-4x	FxSR-PD t=0.0	DISTS: 0.1205 LPIPS: 0.1519 LRPSNR: 52.22 NIQE: 6.05 PSNR: 29.94 SSIM: 0.8629
image-super-resolution-on-general100-8x	FxSR-PD t=0.8	DISTS: 0.1716 LPIPS: 0.2058 LRPSNR: 41.36 NIQE: 5.46 PSNR: 24 SSIM: 0.6534
image-super-resolution-on-general100-8x	FxSR-PD t=0.0	DISTS: 0.2134 LPIPS: 0.2924 LRPSNR: 44.28 NIQE: 6.09 PSNR: 25.42 SSIM: 0.7097

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started

Hyper Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

Command Palette