HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

RPT: Learning Point Set Representation for Siamese Visual Tracking

Ziang Ma Linyuan Wang Haitao Zhang Wei Lu Jun Yin

RPT: Learning Point Set Representation for Siamese Visual Tracking

Abstract

While remarkable progress has been made in robust visual tracking, accurate target state estimation still remains a highly challenging problem. In this paper, we argue that this issue is closely related to the prevalent bounding box representation, which provides only a coarse spatial extent of object. Thus an effcient visual tracking framework is proposed to accurately estimate the target state with a finer representation as a set of representative points. The point set is trained to indicate the semantically and geometrically significant positions of target region, enabling more fine-grained localization and modeling of object appearance. We further propose a multi-level aggregation strategy to obtain detailed structure information by fusing hierarchical convolution layers. Extensive experiments on several challenging benchmarks including OTB2015, VOT2018, VOT2019 and GOT-10k demonstrate that our method achieves new state-of-the-art performance while running at over 20 FPS.

Benchmarks

BenchmarkMethodologyMetrics
semi-supervised-video-object-segmentation-on-15RPT
EAO: 0.530
EAO (real-time): 0.290

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
RPT: Learning Point Set Representation for Siamese Visual Tracking | Papers | HyperAI