HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

RGB-T Tracking via Multi-Modal Mutual Prompt Learning

Yang Luo; Xiqing Guo; Hui Feng; Lei Ao

RGB-T Tracking via Multi-Modal Mutual Prompt Learning

Abstract

Object tracking based on the fusion of visible and thermal im-ages, known as RGB-T tracking, has gained increasing atten-tion from researchers in recent years. How to achieve a more comprehensive fusion of information from the two modalities with fewer computational costs has been a problem that re-searchers have been exploring. Recently, with the rise of prompt learning in computer vision, we can better transfer knowledge from visual large models to downstream tasks. Considering the strong complementarity between visible and thermal modalities, we propose a tracking architecture based on mutual prompt learning between the two modalities. We also design a lightweight prompter that incorporates attention mechanisms in two dimensions to transfer information from one modality to the other with lower computational costs, embedding it into each layer of the backbone. Extensive ex-periments have demonstrated that our proposed tracking ar-chitecture is effective and efficient, achieving state-of-the-art performance while maintaining high running speeds.

Code Repositories

husteryoung/mplt
Official
pytorch

Benchmarks

BenchmarkMethodologyMetrics
rgb-t-tracking-on-lasherMPLT
Precision: 72.0
Success: 57.1
rgb-t-tracking-on-rgbt210MPLT
Precision: 86.2
Success: 63.0
rgb-t-tracking-on-rgbt234MPLT
Precision: 88.4
Success: 65.7

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
RGB-T Tracking via Multi-Modal Mutual Prompt Learning | Papers | HyperAI