HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

DVIS-DAQ: Improving Video Segmentation via Dynamic Anchor Queries

Yikang Zhou; Tao Zhang; Shunping Ji; Shuicheng Yan; Xiangtai Li

DVIS-DAQ: Improving Video Segmentation via Dynamic Anchor Queries

Abstract

Modern video segmentation methods adopt object queries to perform inter-frame association and demonstrate satisfactory performance in tracking continuously appearing objects despite large-scale motion and transient occlusion. However, they all underperform on newly emerging and disappearing objects that are common in the real world because they attempt to model object emergence and disappearance through feature transitions between background and foreground queries that have significant feature gaps. We introduce Dynamic Anchor Queries (DAQ) to shorten the transition gap between the anchor and target queries by dynamically generating anchor queries based on the features of potential candidates. Furthermore, we introduce a query-level object Emergence and Disappearance Simulation (EDS) strategy, which unleashes DAQ's potential without any additional cost. Finally, we combine our proposed DAQ and EDS with DVIS to obtain DVIS-DAQ. Extensive experiments demonstrate that DVIS-DAQ achieves a new state-of-the-art (SOTA) performance on five mainstream video segmentation benchmarks. Code and models are available at \url{https://github.com/SkyworkAI/DAQ-VS}.

Code Repositories

zhang-tao-whu/DVIS_Plus
pytorch
Mentioned in GitHub
zhang-tao-whu/DVIS
pytorch
Mentioned in GitHub
skyworkai/daq-vs
Official
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
video-instance-segmentation-on-ovis-1DVIS-DAQ(VIT-L, Offline)
AP50: 83.8
AP75: 62.9
mask AP: 57.1
video-instance-segmentation-on-youtube-vis-2DVIS-DAQ(VIT-L, Offline)
AP50: 86.1
AP75: 72.2
AR1: 49.6
AR10: 70.7
mask AP: 64.5

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
DVIS-DAQ: Improving Video Segmentation via Dynamic Anchor Queries | Papers | HyperAI