Command Palette
Search for a command to run...
Seung Ho Park Young Su Moon Nam Ik Cho

Abstract
Recent studies have significantly enhanced the performance of single-image super-resolution (SR) using convolutional neural networks (CNNs). While there can be many high-resolution (HR) solutions for a given input, most existing CNN-based methods do not explore alternative solutions during the inference. A typical approach to obtaining alternative SR results is to train multiple SR models with different loss weightings and exploit the combination of these models. Instead of using multiple models, we present a more efficient method to train a single adjustable SR model on various combinations of losses by taking advantage of multi-task learning. Specifically, we optimize an SR model with a conditional objective during training, where the objective is a weighted sum of multiple perceptual losses at different feature levels. The weights vary according to given conditions, and the set of weights is defined as a style controller. Also, we present an architecture appropriate for this training scheme, which is the Residual-in-Residual Dense Block equipped with spatial feature transformation layers. At the inference phase, our trained model can generate locally different outputs conditioned on the style control map. Extensive experiments show that the proposed SR model produces various desirable reconstructions without artifacts and yields comparable quantitative performance to state-of-the-art SR methods.
Code Repositories
Benchmarks
| Benchmark | Methodology | Metrics |
|---|---|---|
| image-super-resolution-on-bsd100-4x-upscaling | FxSR-PD t=0.8 | LPIPS: 0.1572 PSNR: 24.77 SSIM: 0.6817 |
| image-super-resolution-on-bsd100-4x-upscaling | FxSR-PD t=0.0 | LPIPS: 0.3433 PSNR: 26.38 SSIM: 0.738 |
| image-super-resolution-on-bsd100-8x-upscaling | FxSR-PD t=0.0 | DISTS: 0.2753 LPIPS: 0.5079 LRPSNR: 47.12 NIQE: 5.49 PSNR: 23.6 SSIM: 0.5728 |
| image-super-resolution-on-bsd100-8x-upscaling | FxSR-PD t=0.8 | DISTS: 0.1972 LPIPS: 0.3129 LRPSNR: 42.41 NIQE: 4.58 PSNR: 21.93 SSIM: 0.5039 |
| image-super-resolution-on-div2k-val-4x | FxSR-PD t=0.8 | DISTS: 0.0513 LPIPS: 0.1028 LRPSNR: 50.54 NIQE: 2.81 PSNR: 27.51 SSIM: 0.789 |
| image-super-resolution-on-div2k-val-4x | FxSR-PD t=0.0 | DISTS: 0.1169 LPIPS: 0.239 LRPSNR: 53.3 NIQE: 4.11 PSNR: 29.24 SSIM: 0.8383 |
| image-super-resolution-on-div2k-val-8x | FxSR-PD t=0.8 | DISTS: 0.119 LPIPS: 0.2403 LRPSNR: 42.66 NIQE: 3.61 PSNR: 23.56 SSIM: 0.6241 |
| image-super-resolution-on-div2k-val-8x | FxSR-PD t=0.0 | DISTS: 0.1953 LPIPS: 0.3857 LRPSNR: 46.96 NIQE: 4.41 PSNR: 25.6 SSIM: 0.6989 |
| image-super-resolution-on-general100-4x | FxSR-PD t=0.8 | DISTS: 0.0831 LPIPS: 0.0784 LRPSNR: 49.82 NIQE: 4.54 PSNR: 28.44 SSIM: 0.8229 |
| image-super-resolution-on-general100-4x | FxSR-PD t=0.0 | DISTS: 0.1205 LPIPS: 0.1519 LRPSNR: 52.22 NIQE: 6.05 PSNR: 29.94 SSIM: 0.8629 |
| image-super-resolution-on-general100-8x | FxSR-PD t=0.8 | DISTS: 0.1716 LPIPS: 0.2058 LRPSNR: 41.36 NIQE: 5.46 PSNR: 24 SSIM: 0.6534 |
| image-super-resolution-on-general100-8x | FxSR-PD t=0.0 | DISTS: 0.2134 LPIPS: 0.2924 LRPSNR: 44.28 NIQE: 6.09 PSNR: 25.42 SSIM: 0.7097 |
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.