Command Palette
Search for a command to run...
Disp R-CNN: Stereo 3D Object Detection via Shape Prior Guided Instance Disparity Estimation
Sun Jiaming ; Chen Linghao ; Xie Yiming ; Zhang Siyu ; Jiang Qinhong ; Zhou Xiaowei ; Bao Hujun

Abstract
In this paper, we propose a novel system named Disp R-CNN for 3D objectdetection from stereo images. Many recent works solve this problem by firstrecovering a point cloud with disparity estimation and then apply a 3Ddetector. The disparity map is computed for the entire image, which is costlyand fails to leverage category-specific prior. In contrast, we design aninstance disparity estimation network (iDispNet) that predicts disparity onlyfor pixels on objects of interest and learns a category-specific shape priorfor more accurate disparity estimation. To address the challenge from scarcityof disparity annotation in training, we propose to use a statistical shapemodel to generate dense disparity pseudo-ground-truth without the need of LiDARpoint clouds, which makes our system more widely applicable. Experiments on theKITTI dataset show that, even when LiDAR ground-truth is not available attraining time, Disp R-CNN achieves competitive performance and outperformsprevious state-of-the-art methods by 20% in terms of average precision.
Code Repositories
Benchmarks
| Benchmark | Methodology | Metrics |
|---|---|---|
| 3d-object-detection-from-stereo-images-on-1 | Disp R-CNN | AP75: 45.78 |
| 3d-object-detection-from-stereo-images-on-2 | Disp R-CNN | AP50: 25.80 |
| 3d-object-detection-from-stereo-images-on-3 | Disp R-CNN | AP50: 24.40 |
| vehicle-pose-estimation-on-kitti-cars-hard | Disp-RCNN (Stereo) | Average Orientation Similarity: 67.16 |
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.