HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

Learning to Fuse Asymmetric Feature Maps in Siamese Trackers

Wencheng Han Xingping Dong Fahad Shahbaz Khan Ling Shao Jianbing Shen

Learning to Fuse Asymmetric Feature Maps in Siamese Trackers

Abstract

Recently, Siamese-based trackers have achieved promising performance in visual tracking. Most recent Siamese-based trackers typically employ a depth-wise cross-correlation (DW-XCorr) to obtain multi-channel correlation information from the two feature maps (target and search region). However, DW-XCorr has several limitations within Siamese-based tracking: it can easily be fooled by distractors, has fewer activated channels, and provides weak discrimination of object boundaries. Further, DW-XCorr is a handcrafted parameter-free module and cannot fully benefit from offline learning on large-scale data. We propose a learnable module, called the asymmetric convolution (ACM), which learns to better capture the semantic correlation information in offline training on large-scale data. Different from DW-XCorr and its predecessor(XCorr), which regard a single feature map as the convolution kernel, our ACM decomposes the convolution operation on a concatenated feature map into two mathematically equivalent operations, thereby avoiding the need for the feature maps to be of the same size (width and height)during concatenation. Our ACM can incorporate useful prior information, such as bounding-box size, with standard visual features. Furthermore, ACM can easily be integrated into existing Siamese trackers based on DW-XCorror XCorr. To demonstrate its generalization ability, we integrate ACM into three representative trackers: SiamFC, SiamRPN++, and SiamBAN. Our experiments reveal the benefits of the proposed ACM, which outperforms existing methods on six tracking benchmarks. On the LaSOT test set, our ACM-based tracker obtains a significant improvement of 5.8% in terms of success (AUC), over the baseline.

Code Repositories

wencheng256/SiamBAN-ACM
Official
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
video-object-tracking-on-nv-vot211SiamBAN-ACM
AUC: 35.80
Precision: 48.31
visual-object-tracking-on-lasotSiamBAN-ACM
AUC: 57.2
Normalized Precision: 65.3
Precision: 58.7
visual-object-tracking-on-trackingnetSiamBAN-ACM
Accuracy: 75.3
Normalized Precision: 81.0
Precision: 71.2

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Learning to Fuse Asymmetric Feature Maps in Siamese Trackers | Papers | HyperAI