HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

Objects do not disappear: Video object detection by single-frame object location anticipation

Xin Liu; Fatemeh Karimi Nejadasl; Jan C. van Gemert; Olaf Booij; Silvia L. Pintea

Objects do not disappear: Video object detection by single-frame object location anticipation

Abstract

Objects in videos are typically characterized by continuous smooth motion. We exploit continuous smooth motion in three ways. 1) Improved accuracy by using object motion as an additional source of supervision, which we obtain by anticipating object locations from a static keyframe. 2) Improved efficiency by only doing the expensive feature computations on a small subset of all frames. Because neighboring video frames are often redundant, we only compute features for a single static keyframe and predict object locations in subsequent frames. 3) Reduced annotation cost, where we only annotate the keyframe and use smooth pseudo-motion between keyframes. We demonstrate computational efficiency, annotation efficiency, and improved mean average precision compared to the state-of-the-art on four datasets: ImageNet VID, EPIC KITCHENS-55, YouTube-BoundingBoxes, and Waymo Open dataset. Our source code is available at https://github.com/L-KID/Videoobject-detection-by-location-anticipation.

Code Repositories

Benchmarks

BenchmarkMethodologyMetrics
video-object-detection-on-epic-kitchens-55Ours (Faster RCNN)
mAP@.5: 41.7
video-object-detection-on-imagenet-vidOurs (Faster RCNN + R101)
MAP : 87.2
video-object-detection-on-imagenet-vidOurs (Def. DETR + SwinB)
MAP : 91.3
video-object-detection-on-imagenet-vidOurs (Def. DETR + R101)
MAP : 87.9
video-object-detection-on-waymo-open-dataset-
AP: 59.28
video-object-detection-on-yt-bb-
mAP: 59.8

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Objects do not disappear: Video object detection by single-frame object location anticipation | Papers | HyperAI