HyperAIHyperAI

Command Palette

Search for a command to run...

4 months ago

Learning Discriminative Model Prediction for Tracking

Goutam Bhat; Martin Danelljan; Luc Van Gool; Radu Timofte

Learning Discriminative Model Prediction for Tracking

Abstract

The current strive towards end-to-end trainable computer vision systems imposes major challenges for the task of visual tracking. In contrast to most other vision problems, tracking requires the learning of a robust target-specific appearance model online, during the inference stage. To be end-to-end trainable, the online learning of the target model thus needs to be embedded in the tracking architecture itself. Due to the imposed challenges, the popular Siamese paradigm simply predicts a target feature template, while ignoring the background appearance information during inference. Consequently, the predicted model possesses limited target-background discriminability. We develop an end-to-end tracking architecture, capable of fully exploiting both target and background appearance information for target model prediction. Our architecture is derived from a discriminative learning loss by designing a dedicated optimization process that is capable of predicting a powerful model in only a few iterations. Furthermore, our approach is able to learn key aspects of the discriminative loss itself. The proposed tracker sets a new state-of-the-art on 6 tracking benchmarks, achieving an EAO score of 0.440 on VOT2018, while running at over 40 FPS. The code and models are available at https://github.com/visionml/pytracking.

Code Repositories

visionml/pytracking
Official
pytorch
Mentioned in GitHub
martin-danelljan/ECO
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
object-tracking-on-fe108DiMP
Averaged Precision: 85.1
Success Rate: 57.1
video-object-tracking-on-nv-vot211DiMP-50
AUC: 35.89
Precision: 48.68
visual-object-tracking-on-got-10kDiMP
Average Overlap: 61.1
Success Rate 0.5: 71.7
visual-object-tracking-on-lasotDiMP-50
Precision: 68.7
visual-object-tracking-on-lasotDiMP
AUC: 56.8
Normalized Precision: 65.0
Precision: 56.7
visual-object-tracking-on-trackingnetDiMP-50
Accuracy: 74.0
Normalized Precision: 80.1

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Learning Discriminative Model Prediction for Tracking | Papers | HyperAI