HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

DuLa-Net: A Dual-Projection Network for Estimating Room Layouts from a Single RGB Panorama

Yang Shang-Ta ; Wang Fu-En ; Peng Chi-Han ; Wonka Peter ; Sun Min ; Chu Hung-Kuo

DuLa-Net: A Dual-Projection Network for Estimating Room Layouts from a
  Single RGB Panorama

Abstract

We present a deep learning framework, called DuLa-Net, to predictManhattan-world 3D room layouts from a single RGB panorama. To achieve betterprediction accuracy, our method leverages two projections of the panorama atonce, namely the equirectangular panorama-view and the perspectiveceiling-view, that each contains different clues about the room layouts. Ournetwork architecture consists of two encoder-decoder branches for analyzingeach of the two views. In addition, a novel feature fusion structure isproposed to connect the two branches, which are then jointly trained to predictthe 2D floor plans and layout heights. To learn more complex room layouts, weintroduce the Realtor360 dataset that contains panoramas of Manhattan-worldroom layouts with different numbers of corners. Experimental results show thatour work outperforms recent state-of-the-art in prediction accuracy andperformance, especially in the rooms with non-cuboid layouts.

Code Repositories

SunDaDenny/DuLa-Net
pytorch
Mentioned in GitHub

Benchmarks

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
DuLa-Net: A Dual-Projection Network for Estimating Room Layouts from a Single RGB Panorama | Papers | HyperAI