3 months ago

Stereo Magnification with Multi-Layer Images

Taras Khakhulin Denis Korzhenkov Pavel Solovev Gleb Sterkin Timotei Ardelean Victor Lempitsky

Abstract

Representing scenes with multiple semi-transparent colored layers has been a popular and successful choice for real-time novel view synthesis. Existing approaches infer colors and transparency values over regularly-spaced layers of planar or spherical shape. In this work, we introduce a new view synthesis approach based on multiple semi-transparent layers with scene-adapted geometry. Our approach infers such representations from stereo pairs in two stages. The first stage infers the geometry of a small number of data-adaptive layers from a given pair of views. The second stage infers the color and the transparency values for these layers producing the final representation for novel view synthesis. Importantly, both stages are connected through a differentiable renderer and are trained in an end-to-end manner. In the experiments, we demonstrate the advantage of the proposed approach over the use of regularly-spaced layers with no adaptation to scene geometry. Despite being orders of magnitude faster during rendering, our approach also outperforms a recently proposed IBRNet system based on implicit geometry representation. See results at https://samsunglabs.github.io/StereoLayers .

Benchmarks

Benchmark	Methodology	Metrics
novel-view-synthesis-on-sword	StereoLayers	LPIPS: 0.096 PSNR: 25.95 SSIM: 0.81
novel-view-synthesis-on-sword	StereoLayers (8 layers)	LPIPS: 0.113 PSNR: 25.54 SSIM: 0.79
novel-view-synthesis-on-sword	StereoLayers (2 layers)	LPIPS: 0.102 PSNR: 25.28 SSIM: 0.78

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started

Hyper Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning