HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

Exploring Object-Centric Temporal Modeling for Efficient Multi-View 3D Object Detection

Wang Shihao ; Liu Yingfei ; Wang Tiancai ; Li Ying ; Zhang Xiangyu

Exploring Object-Centric Temporal Modeling for Efficient Multi-View 3D
  Object Detection

Abstract

In this paper, we propose a long-sequence modeling framework, namedStreamPETR, for multi-view 3D object detection. Built upon the sparse querydesign in the PETR series, we systematically develop an object-centric temporalmechanism. The model is performed in an online manner and the long-termhistorical information is propagated through object queries frame by frame.Besides, we introduce a motion-aware layer normalization to model the movementof the objects. StreamPETR achieves significant performance improvements onlywith negligible computation cost, compared to the single-frame baseline. On thestandard nuScenes benchmark, it is the first online multi-view method thatachieves comparable performance (67.6% NDS & 65.3% AMOTA) with lidar-basedmethods. The lightweight version realizes 45.0% mAP and 31.7 FPS, outperformingthe state-of-the-art method (SOLOFusion) by 2.3% mAP and 1.8x faster FPS. Codehas been available at https://github.com/exiawsh/StreamPETR.git.

Code Repositories

exiawsh/streampetr
Official
pytorch
Mentioned in GitHub
wenyuqing/panacea
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
3d-multi-object-tracking-on-nuscenes-camera-1StreamPETR-Large
AMOTA: 65.3
3d-object-detection-on-3d-object-detection-onStreamPETR
Average mAP: 20.3
3d-object-detection-on-nuscenes-camera-onlyStreamPETR-Large
Future Frame: false
NDS: 67.6

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Exploring Object-Centric Temporal Modeling for Efficient Multi-View 3D Object Detection | Papers | HyperAI