HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

DSGN++: Exploiting Visual-Spatial Relation for Stereo-based 3D Detectors

Chen Yilun ; Huang Shijia ; Liu Shu ; Yu Bei ; Jia Jiaya

DSGN++: Exploiting Visual-Spatial Relation for Stereo-based 3D Detectors

Abstract

Camera-based 3D object detectors are welcome due to their wider deploymentand lower price than LiDAR sensors. We first revisit the prior stereo detectorDSGN for its stereo volume construction ways for representing both 3D geometryand semantics. We polish the stereo modeling and propose the advanced version,DSGN++, aiming to enhance effective information flow throughout the 2D-to-3Dpipeline in three main aspects. First, to effectively lift the 2D informationto stereo volume, we propose depth-wise plane sweeping (DPS) that allows denserconnections and extracts depth-guided features. Second, for graspingdifferently spaced features, we present a novel stereo volume -- Dual-viewStereo Volume (DSV) that integrates front-view and top-view features andreconstructs sub-voxel depth in the camera frustum. Third, as the foregroundregion becomes less dominant in 3D space, we propose a multi-modal data editingstrategy -- Stereo-LiDAR Copy-Paste, which ensures cross-modal alignment andimproves data efficiency. Without bells and whistles, extensive experiments invarious modality setups on the popular KITTI benchmark show that our methodconsistently outperforms other camera-based 3D detectors for all categories.Code is available at https://github.com/chenyilun95/DSGN2.

Code Repositories

chenyilun95/dsgn2
Official
pytorch
Mentioned in GitHub

Benchmarks

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
DSGN++: Exploiting Visual-Spatial Relation for Stereo-based 3D Detectors | Papers | HyperAI