HyperAIHyperAI

Command Palette

Search for a command to run...

a month ago

A Distractor-Aware Memory for Visual Object Tracking with SAM2

Jovana Videnovic Alan Lukezic Matej Kristan

A Distractor-Aware Memory for Visual Object Tracking with SAM2

Abstract

Memory-based trackers are video object segmentation methods that form the target model by concatenating recently tracked frames into a memory buffer and localize the target by attending the current image to the buffered frames. While already achieving top performance on many benchmarks, it was the recent release of SAM2 that placed memory-based trackers into focus of the visual object tracking community. Nevertheless, modern trackers still struggle in the presence of distractors. We argue that a more sophisticated memory model is required, and propose a new distractor-aware memory model for SAM2 and an introspection-based update strategy that jointly addresses the segmentation accuracy as well as tracking robustness. The resulting tracker is denoted as SAM2.1++. We also propose a new distractor-distilled DiDi dataset to study the distractor problem better. SAM2.1++ outperforms SAM2.1 and related SAM memory extensions on seven benchmarks and sets a solid new state-of-the-art on six of them.

Code Repositories

jovanavidenovic/dam4sam
Official
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
semi-supervised-video-object-segmentation-on-15DAM4SAM
EAO: 0.729
visual-object-tracking-on-didiDAM4SAM
Tracking quality: 0.694
visual-object-tracking-on-got-10kDAM4SAM
Average Overlap: 81.1
visual-object-tracking-on-lasotDAM4SAM
AUC: 75.1
visual-object-tracking-on-lasot-extDAM4SAM
AUC: 60.9
visual-object-tracking-on-vot2022DAM4SAM
EAO: 0.753

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
A Distractor-Aware Memory for Visual Object Tracking with SAM2 | Papers | HyperAI