Visual Object Tracking On Got 10K

评估指标

Average Overlap
Success Rate 0.5
Success Rate 0.75

评测结果

各个模型在此基准测试上的表现结果

Paper TitleRepository
SAMURAI-L81.792.276.9SAMURAI: Adapting Segment Anything Model for Zero-Shot Visual Tracking with Motion-Aware Memory
DAM4SAM81.1--A Distractor-Aware Memory for Visual Object Tracking with SAM2
MITS80.489.875.8Integrating Boxes and Masks: A Multi-Object Framework for Unified Visual Tracking and Segmentation
MCITrack-L38480.088.580.2Exploring Enhanced Contextual Information for Video-Level Object Tracking
ARTrackV2-L79.587.879.6ARTrackV2: Prompting Autoregressive Tracker Where to Look and How to Describe
LoRAT-g-37878.987.880.7Tracking Meets LoRA: Faster Training, Larger Model, Stronger Performance
ARTrack-L78.587.477.8Autoregressive Visual Tracking-
ODTrack-L78.2--ODTrack: Online Dense Temporal Token Learning for Visual Tracking
MCITrack-B22477.988.276.8Exploring Enhanced Contextual Information for Video-Level Object Tracking
RTracker-L77.98776.9RTracker: Recoverable Tracking via PN Tree Structured Memory
LoRAT-L-37877.586.278.1Tracking Meets LoRA: Faster Training, Larger Model, Stronger Performance
HIPTrack77.488.074.5HIPTrack: Visual Tracking with Historical Prompts
ODTrack-B77.0--ODTrack: Online Dense Temporal Token Learning for Visual Tracking
TATrack-L-GOT76.685.773.4Target-Aware Tracking with Long-term Context Attention
DropMAE75.986.872DropMAE: Learning Representations via Masked Autoencoders with Spatial-Attention Dropout for Temporal Matching Tasks
NeighborTrack-OSTrack75.785.7273.3NeighborTrack: Improving Single Object Tracking by Bipartite Matching with Neighbor Tracklets
MixViT-L(ConvMAE)75.785.375.1MixFormer: End-to-End Tracking with Iterative Mixed Attention
MixFormer-L75.685.7372.8MixFormer: End-to-End Tracking with Iterative Mixed Attention
SeqTrack-L38474.881.972.2Unified Sequence-to-Sequence Learning for Single- and Multi-Modal Visual Object Tracking
OSTrack-38473.783.270.8Joint Feature Learning and Relation Modeling for Tracking: A One-Stream Framework
0 of 39 row(s) selected.
Visual Object Tracking On Got 10K | SOTA | HyperAI超神经