HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

Joint Feature Learning and Relation Modeling for Tracking: A One-Stream Framework

Botao Ye; Hong Chang; Bingpeng Ma; Shiguang Shan; Xilin Chen

Joint Feature Learning and Relation Modeling for Tracking: A One-Stream Framework

Abstract

The current popular two-stream, two-stage tracking framework extracts the template and the search region features separately and then performs relation modeling, thus the extracted features lack the awareness of the target and have limited target-background discriminability. To tackle the above issue, we propose a novel one-stream tracking (OSTrack) framework that unifies feature learning and relation modeling by bridging the template-search image pairs with bidirectional information flows. In this way, discriminative target-oriented features can be dynamically extracted by mutual guidance. Since no extra heavy relation modeling module is needed and the implementation is highly parallelized, the proposed tracker runs at a fast speed. To further improve the inference efficiency, an in-network candidate early elimination module is proposed based on the strong similarity prior calculated in the one-stream framework. As a unified framework, OSTrack achieves state-of-the-art performance on multiple benchmarks, in particular, it shows impressive results on the one-shot tracking benchmark GOT-10k, i.e., achieving 73.7% AO, improving the existing best result (SwinTrack) by 4.3\%. Besides, our method maintains a good performance-speed trade-off and shows faster convergence. The code and models are available at https://github.com/botaoye/OSTrack.

Code Repositories

botaoye/ostrack
Official
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
object-tracking-on-coesotOSTrack
Precision Rate: 66.6
Success Rate: 59.0
video-object-tracking-on-nv-vot211OSTrack-384
AUC: 38.59
Precision: 53.06
visual-object-tracking-on-got-10kOSTrack-384
Average Overlap: 73.7
Success Rate 0.5: 83.2
Success Rate 0.75: 70.8
visual-object-tracking-on-lasotOSTrack-384
AUC: 71.1
Normalized Precision: 81.1
Precision: 77.6
visual-object-tracking-on-lasot-extOSTrack
AUC: 50.6
Normalized Precision: 61.3
Precision: 57.6
visual-object-tracking-on-trackingnetOSTrack-384
Accuracy: 83.9
Normalized Precision: 88.5
Precision: 83.2
visual-object-tracking-on-uav123OSTrack -384
AUC: 0.707
visual-tracking-on-tnl2kOSTrack
AUC: 55.9

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Joint Feature Learning and Relation Modeling for Tracking: A One-Stream Framework | Papers | HyperAI