HyperAIHyperAI

Command Palette

Search for a command to run...

4 months ago

Breaking Modality Gap in RGBT Tracking: Coupled Knowledge Distillation

Andong Lu; Jiacong Zhao; Chenglong Li; Yun Xiao; Bin Luo

Breaking Modality Gap in RGBT Tracking: Coupled Knowledge Distillation

Abstract

Modality gap between RGB and thermal infrared (TIR) images is a crucial issue but often overlooked in existing RGBT tracking methods. It can be observed that modality gap mainly lies in the image style difference. In this work, we propose a novel Coupled Knowledge Distillation framework called CKD, which pursues common styles of different modalities to break modality gap, for high performance RGBT tracking. In particular, we introduce two student networks and employ the style distillation loss to make their style features consistent as much as possible. Through alleviating the style difference of two student networks, we can break modality gap of different modalities well. However, the distillation of style features might harm to the content representations of two modalities in student networks. To handle this issue, we take original RGB and TIR networks as the teachers, and distill their content knowledge into two student networks respectively by the style-content orthogonal feature decoupling scheme. We couple the above two distillation processes in an online optimization framework to form new feature representations of RGB and thermal modalities without modality gap. In addition, we design a masked modeling strategy and a multi-modal candidate token elimination strategy into CKD to improve tracking robustness and efficiency respectively. Extensive experiments on five standard RGBT tracking datasets validate the effectiveness of the proposed method against state-of-the-art methods while achieving the fastest tracking speed of 96.4 FPS. Code available at https://github.com/Multi-Modality-Tracking/CKD.

Code Repositories

Benchmarks

BenchmarkMethodologyMetrics
rgb-t-tracking-on-lasherCKD
Precision: 73.2
Success: 58.1
rgb-t-tracking-on-rgbt210CKD
Precision: 88.4
Success: 65.2
rgb-t-tracking-on-rgbt234CKD
Precision: 90.0
Success: 67.4

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Breaking Modality Gap in RGBT Tracking: Coupled Knowledge Distillation | Papers | HyperAI