HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

Kinematic 3D Object Detection in Monocular Video

Garrick Brazil; Gerard Pons-Moll; Xiaoming Liu; Bernt Schiele

Kinematic 3D Object Detection in Monocular Video

Abstract

Perceiving the physical world in 3D is fundamental for self-driving applications. Although temporal motion is an invaluable resource to human vision for detection, tracking, and depth perception, such features have not been thoroughly utilized in modern 3D object detectors. In this work, we propose a novel method for monocular video-based 3D object detection which carefully leverages kinematic motion to improve precision of 3D localization. Specifically, we first propose a novel decomposition of object orientation as well as a self-balancing 3D confidence. We show that both components are critical to enable our kinematic model to work effectively. Collectively, using only a single model, we efficiently leverage 3D kinematics from monocular videos to improve the overall localization precision in 3D object detection while also producing useful by-products of scene dynamics (ego-motion and per-object velocity). We achieve state-of-the-art performance on monocular 3D object detection and the Bird's Eye View tasks within the KITTI self-driving dataset.

Code Repositories

garrickbrazil/kinematic3d
pytorch
Mentioned in GitHub
Nicholasli1995/EgoNet
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
3d-object-detection-on-rope3dKinematic3D+(G)
AP@0.7: 17.74
monocular-3d-object-detection-on-kitti-carsKinematic3D
AP Medium: 12.72
vehicle-pose-estimation-on-kitti-cars-hardKinematic3D
Average Orientation Similarity: 34.81

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Kinematic 3D Object Detection in Monocular Video | Papers | HyperAI