HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

ProContEXT: Exploring Progressive Context Transformer for Tracking

Jin-Peng Lan; Zhi-Qi Cheng; Jun-Yan He; Chenyang Li; Bin Luo; Xu Bao; Wangmeng Xiang; Yifeng Geng; Xuansong Xie

ProContEXT: Exploring Progressive Context Transformer for Tracking

Abstract

Existing Visual Object Tracking (VOT) only takes the target area in the first frame as a template. This causes tracking to inevitably fail in fast-changing and crowded scenes, as it cannot account for changes in object appearance between frames. To this end, we revamped the tracking framework with Progressive Context Encoding Transformer Tracker (ProContEXT), which coherently exploits spatial and temporal contexts to predict object motion trajectories. Specifically, ProContEXT leverages a context-aware self-attention module to encode the spatial and temporal context, refining and updating the multi-scale static and dynamic templates to progressively perform accurately tracking. It explores the complementary between spatial and temporal context, raising a new pathway to multi-context modeling for transformer-based trackers. In addition, ProContEXT revised the token pruning technique to reduce computational complexity. Extensive experiments on popular benchmark datasets such as GOT-10k and TrackingNet demonstrate that the proposed ProContEXT achieves state-of-the-art performance.

Code Repositories

zhiqic/procontext
Official
pytorch
Mentioned in GitHub
jp-lan/procontext
Official
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
video-object-tracking-on-nv-vot211ProContEXT
AUC: 40.10
Precision: 54.50

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
ProContEXT: Exploring Progressive Context Transformer for Tracking | Papers | HyperAI