HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

Atlas: End-to-End 3D Scene Reconstruction from Posed Images

Zak Murez Tarrence van As James Bartolozzi Ayan Sinha Vijay Badrinarayanan Andrew Rabinovich

Atlas: End-to-End 3D Scene Reconstruction from Posed Images

Abstract

We present an end-to-end 3D reconstruction method for a scene by directly regressing a truncated signed distance function (TSDF) from a set of posed RGB images. Traditional approaches to 3D reconstruction rely on an intermediate representation of depth maps prior to estimating a full 3D model of a scene. We hypothesize that a direct regression to 3D is more effective. A 2D CNN extracts features from each image independently which are then back-projected and accumulated into a voxel volume using the camera intrinsics and extrinsics. After accumulation, a 3D CNN refines the accumulated features and predicts the TSDF values. Additionally, semantic segmentation of the 3D model is obtained without significant computation. This approach is evaluated on the Scannet dataset where we significantly outperform state-of-the-art baselines (deep multiview stereo followed by traditional TSDF fusion) both quantitatively and qualitatively. We compare our 3D semantic segmentation to prior methods that use a depth sensor since no previous work attempts the problem with only RGB input.

Code Repositories

magicleap/Atlas
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
3d-reconstruction-on-scannetAtlas (finetuned)
3DIoU: 89.4
Chamfer Distance: 37.2
L1: 21.1
depth-estimation-on-scannetAtlas (finetuned)
RMSE: 0.174
absolute relative error: 0.089
depth-estimation-on-scannetAtlas (plain)
RMSE: 0.165

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Atlas: End-to-End 3D Scene Reconstruction from Posed Images | Papers | HyperAI