5 months ago

GrooMeD-NMS: Grouped Mathematically Differentiable NMS for Monocular 3D Object Detection

Kumar Abhinav ; Brazil Garrick ; Liu Xiaoming

Abstract

Modern 3D object detectors have immensely benefited from the end-to-endlearning idea. However, most of them use a post-processing algorithm calledNon-Maximal Suppression (NMS) only during inference. While there were attemptsto include NMS in the training pipeline for tasks such as 2D object detection,they have been less widely adopted due to a non-mathematical expression of theNMS. In this paper, we present and integrate GrooMeD-NMS -- a novel GroupedMathematically Differentiable NMS for monocular 3D object detection, such thatthe network is trained end-to-end with a loss on the boxes after NMS. We firstformulate NMS as a matrix operation and then group and mask the boxes in anunsupervised manner to obtain a simple closed-form expression of the NMS.GrooMeD-NMS addresses the mismatch between training and inference pipelinesand, therefore, forces the network to select the best 3D box in adifferentiable manner. As a result, GrooMeD-NMS achieves state-of-the-artmonocular 3D object detection results on the KITTI benchmark dataset performingcomparably to monocular video-based methods. Code and models athttps://github.com/abhi1kumar/groomed_nms

Code Repositories

abhi1kumar/groomed_nms

Official

pytorch

Mentioned in GitHub

Benchmarks

Benchmark	Methodology	Metrics
3d-object-detection-from-monocular-images-on-7	GrooMeD-NMS	AP25: 16.12 AP50: 0.17
monocular-3d-object-detection-on-kitti-cars	GrooMeD-NMS	AP Medium: 12.32

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started

Hyper Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

Command Palette