Command Palette
Search for a command to run...
Sun Xingyuan ; Wu Jiajun ; Zhang Xiuming ; Zhang Zhoutong ; Zhang Chengkai ; Xue Tianfan ; Tenenbaum Joshua B. ; Freeman William T.

Abstract
We study 3D shape modeling from a single image and make contributions to itin three aspects. First, we present Pix3D, a large-scale benchmark of diverseimage-shape pairs with pixel-level 2D-3D alignment. Pix3D has wide applicationsin shape-related tasks including reconstruction, retrieval, viewpointestimation, etc. Building such a large-scale dataset, however, is highlychallenging; existing datasets either contain only synthetic data, or lackprecise alignment between 2D images and 3D shapes, or only have a small numberof images. Second, we calibrate the evaluation criteria for 3D shapereconstruction through behavioral studies, and use them to objectively andsystematically benchmark cutting-edge reconstruction algorithms on Pix3D.Third, we design a novel model that simultaneously performs 3D reconstructionand pose estimation; our multi-task learning approach achieves state-of-the-artperformance on both tasks.
Code Repositories
Benchmarks
| Benchmark | Methodology | Metrics |
|---|---|---|
| 3d-shape-reconstruction-on-pix3d | MarrNet extension (w/ Pose) | CD: 0.119 EMD: 0.118 IoU: 0.282 |
| 3d-shape-retrieval-on-pix3d | MarrNet extension (w/o Pose) | R@1: 0.53 R@16: 0.85 R@2: 0.62 R@32: 0.90 R@4: 0.71 R@8: 0.78 |
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.