Command Palette
Search for a command to run...
Xintao Wang; Ke Yu; Shixiang Wu; Jinjin Gu; Yihao Liu; Chao Dong; Chen Change Loy; Yu Qiao; Xiaoou Tang

Abstract
The Super-Resolution Generative Adversarial Network (SRGAN) is a seminal work that is capable of generating realistic textures during single image super-resolution. However, the hallucinated details are often accompanied with unpleasant artifacts. To further enhance the visual quality, we thoroughly study three key components of SRGAN - network architecture, adversarial loss and perceptual loss, and improve each of them to derive an Enhanced SRGAN (ESRGAN). In particular, we introduce the Residual-in-Residual Dense Block (RRDB) without batch normalization as the basic network building unit. Moreover, we borrow the idea from relativistic GAN to let the discriminator predict relative realness instead of the absolute value. Finally, we improve the perceptual loss by using the features before activation, which could provide stronger supervision for brightness consistency and texture recovery. Benefiting from these improvements, the proposed ESRGAN achieves consistently better visual quality with more realistic and natural textures than SRGAN and won the first place in the PIRM2018-SR Challenge. The code is available at https://github.com/xinntao/ESRGAN .
Code Repositories
Benchmarks
| Benchmark | Methodology | Metrics |
|---|---|---|
| face-hallucination-on-ffhq-512-x-512-16x | ESRGAN | FID: 50.901 LPIPS: 0.3928 NIQE: 15.383 |
| image-super-resolution-on-bsd100-4x-upscaling | SRGAN + Residual-in-Residual Dense Block | PSNR: 27.85 SSIM: 0.7455 |
| image-super-resolution-on-ffhq-1024-x-1024-4x | ESRGAN | FID: 72.73 MS-SSIM: 0.782 PSNR: 19.84 SSIM: 0.353 |
| image-super-resolution-on-ffhq-256-x-256-4x | ESRGAN | FID: 166.36 MS-SSIM: 0.747 PSNR: 15.43 SSIM: 0.267 |
| image-super-resolution-on-ffhq-512-x-512-4x | ESRGAN | FED: 0.1107 FID: 3.503 LLE: 2.261 LPIPS: 0.1221 MS-SSIM: 0.935 NIQE: 6.984 PSNR: 27.134 SSIM: 0.741 |
| image-super-resolution-on-manga109-4x | bicubic | PSNR: 24.89 SSIM: 0.7866 |
| image-super-resolution-on-manga109-4x | SRGAN + Residual-in-Residual Dense Block | PSNR: 31.66 SSIM: 0.9196 |
| image-super-resolution-on-pirm-test | ESRGAN | NIQE: 2.55 |
| image-super-resolution-on-set14-4x-upscaling | SRGAN + Residual-in-Residual Dense Block | PSNR: 28.99 SSIM: 0.7917 |
| image-super-resolution-on-urban100-4x | SRGAN + Residual-in-Residual Dense Block | PSNR: 27.03 SSIM: 0.8153 |
| image-super-resolution-on-urban100-4x | bicubic | PSNR: 23.14 SSIM: 0.6577 |
| video-super-resolution-on-msu-video-upscalers | ESRGAN | PSNR: 27.29 SSIM: 0.936 VMAF: 56.69 |
| video-super-resolution-on-msu-vsr-benchmark | ESRGAN | 1 - LPIPS: 0.948 ERQAv1.0: 0.735 FPS: 1.004 PSNR: 27.33 QRCRv1.0: 0 SSIM: 0.808 Subjective score: 5.353 |
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.