Command Palette
Search for a command to run...
Frustum ConvNet: Sliding Frustums to Aggregate Local Point-Wise Features for Amodal 3D Object Detection
Zhixin Wang; Kui Jia

Abstract
In this work, we propose a novel method termed \emph{Frustum ConvNet (F-ConvNet)} for amodal 3D object detection from point clouds. Given 2D region proposals in an RGB image, our method first generates a sequence of frustums for each region proposal, and uses the obtained frustums to group local points. F-ConvNet aggregates point-wise features as frustum-level feature vectors, and arrays these feature vectors as a feature map for use of its subsequent component of fully convolutional network (FCN), which spatially fuses frustum-level features and supports an end-to-end and continuous estimation of oriented boxes in the 3D space. We also propose component variants of F-ConvNet, including an FCN variant that extracts multi-resolution frustum features, and a refined use of F-ConvNet over a reduced 3D space. Careful ablation studies verify the efficacy of these component variants. F-ConvNet assumes no prior knowledge of the working 3D environment and is thus dataset-agnostic. We present experiments on both the indoor SUN-RGBD and outdoor KITTI datasets. F-ConvNet outperforms all existing methods on SUN-RGBD, and at the time of submission it outperforms all published works on the KITTI benchmark. Code has been made available at: {\url{https://github.com/zhixinwang/frustum-convnet}.}
Code Repositories
Benchmarks
| Benchmark | Methodology | Metrics |
|---|---|---|
| 3d-object-detection-on-kitti-cars-easy | F-ConvNet | AP: 85.88% |
| 3d-object-detection-on-kitti-cars-hard | F-ConvNet | AP: 68.08% |
| 3d-object-detection-on-kitti-cyclists | F-ConvNet | AP: 64.68% |
| 3d-object-detection-on-kitti-cyclists-easy | F-ConvNet | AP: 79.58% |
| 3d-object-detection-on-kitti-cyclists-hard | F-ConvNets | AP: 57.03% |
| 3d-object-detection-on-kitti-pedestrians | F-ConvNet | AP: 43.38% |
| 3d-object-detection-on-kitti-pedestrians-easy | F-ConvNet | AP: 52.37% |
| 3d-object-detection-on-kitti-pedestrians-hard | F-ConvNet | AP: 41.49% |
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.