HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

BEVDepth: Acquisition of Reliable Depth for Multi-view 3D Object Detection

Yinhao Li Zheng Ge Guanyi Yu Jinrong Yang Zengran Wang Yukang Shi Jianjian Sun Zeming Li

BEVDepth: Acquisition of Reliable Depth for Multi-view 3D Object Detection

Abstract

In this research, we propose a new 3D object detector with a trustworthy depth estimation, dubbed BEVDepth, for camera-based Bird's-Eye-View (BEV) 3D object detection. Our work is based on a key observation -- depth estimation in recent approaches is surprisingly inadequate given the fact that depth is essential to camera 3D detection. Our BEVDepth resolves this by leveraging explicit depth supervision. A camera-awareness depth estimation module is also introduced to facilitate the depth predicting capability. Besides, we design a novel Depth Refinement Module to counter the side effects carried by imprecise feature unprojection. Aided by customized Efficient Voxel Pooling and multi-frame mechanism, BEVDepth achieves the new state-of-the-art 60.9% NDS on the challenging nuScenes test set while maintaining high efficiency. For the first time, the NDS score of a camera model reaches 60%.

Code Repositories

megvii-basedetection/bevdepth
Official
pytorch
Mentioned in GitHub
ZRandomize/MatrixVT
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
3d-object-detection-on-dair-v2x-iBEVDepth
AP|R40(easy): 75.7
AP|R40(hard): 63.7
AP|R40(moderate): 63.6
3d-object-detection-on-nuscenes-camera-onlyBEVDepth-pure
Future Frame: false
NDS: 60.9
3d-object-detection-on-rope3dBEVDepth
AP@0.7: 42.56

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
BEVDepth: Acquisition of Reliable Depth for Multi-view 3D Object Detection | Papers | HyperAI