HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

Pseudo-LiDAR from Visual Depth Estimation: Bridging the Gap in 3D Object Detection for Autonomous Driving

Wang Yan ; Chao Wei-Lun ; Garg Divyansh ; Hariharan Bharath ; Campbell Mark ; Weinberger Kilian Q.

Pseudo-LiDAR from Visual Depth Estimation: Bridging the Gap in 3D Object
  Detection for Autonomous Driving

Abstract

3D object detection is an essential task in autonomous driving. Recenttechniques excel with highly accurate detection rates, provided the 3D inputdata is obtained from precise but expensive LiDAR technology. Approaches basedon cheaper monocular or stereo imagery data have, until now, resulted indrastically lower accuracies --- a gap that is commonly attributed to poorimage-based depth estimation. However, in this paper we argue that it is notthe quality of the data but its representation that accounts for the majorityof the difference. Taking the inner workings of convolutional neural networksinto consideration, we propose to convert image-based depth maps topseudo-LiDAR representations --- essentially mimicking the LiDAR signal. Withthis representation we can apply different existing LiDAR-based detectionalgorithms. On the popular KITTI benchmark, our approach achieves impressiveimprovements over the existing state-of-the-art in image-based performance ---raising the detection accuracy of objects within the 30m range from theprevious state-of-the-art of 22% to an unprecedented 74%. At the time ofsubmission our algorithm holds the highest entry on the KITTI 3D objectdetection leaderboard for stereo-image-based approaches. Our code is publiclyavailable at https://github.com/mileyan/pseudo_lidar.

Code Repositories

haosulab/ManiSkill-Learn
pytorch
Mentioned in GitHub
mileyan/pseudo_lidar
Official
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
3d-object-detection-from-stereo-images-on-1Pseudo-LiDAR
AP75: 34.05

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Pseudo-LiDAR from Visual Depth Estimation: Bridging the Gap in 3D Object Detection for Autonomous Driving | Papers | HyperAI