HyperAIHyperAI

Command Palette

Search for a command to run...

4 months ago

What Do Single-view 3D Reconstruction Networks Learn?

Maxim Tatarchenko; Stephan R. Richter; René Ranftl; Zhuwen Li; Vladlen Koltun; Thomas Brox

What Do Single-view 3D Reconstruction Networks Learn?

Abstract

Convolutional networks for single-view object reconstruction have shown impressive performance and have become a popular subject of research. All existing techniques are united by the idea of having an encoder-decoder network that performs non-trivial reasoning about the 3D structure of the output space. In this work, we set up two alternative approaches that perform image classification and retrieval respectively. These simple baselines yield better results than state-of-the-art methods, both qualitatively and quantitatively. We show that encoder-decoder methods are statistically indistinguishable from these baselines, thus indicating that the current state of the art in single-view object reconstruction does not actually perform reconstruction but image classification. We identify aspects of popular experimental procedures that elicit this behavior and discuss ways to improve the current state of research.

Benchmarks

BenchmarkMethodologyMetrics
3d-reconstruction-on-300wResNet
1-of-100 Accuracy: cosine loss

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
What Do Single-view 3D Reconstruction Networks Learn? | Papers | HyperAI