HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

TrickVOS: A Bag of Tricks for Video Object Segmentation

Evangelos Skartados; Konstantinos Georgiadis; Mehmet Kerim Yucel; Koskinas Ioannis; Armando Domi; Anastasios Drosou; Bruno Manganelli; Albert Saa-Garriga

TrickVOS: A Bag of Tricks for Video Object Segmentation

Abstract

Space-time memory (STM) network methods have been dominant in semi-supervised video object segmentation (SVOS) due to their remarkable performance. In this work, we identify three key aspects where we can improve such methods; i) supervisory signal, ii) pretraining and iii) spatial awareness. We then propose TrickVOS; a generic, method-agnostic bag of tricks addressing each aspect with i) a structure-aware hybrid loss, ii) a simple decoder pretraining regime and iii) a cheap tracker that imposes spatial constraints in model predictions. Finally, we propose a lightweight network and show that when trained with TrickVOS, it achieves competitive results to state-of-the-art methods on DAVIS and YouTube benchmarks, while being one of the first STM-based SVOS methods that can run in real-time on a mobile device.

Benchmarks

BenchmarkMethodologyMetrics
semi-supervised-video-object-segmentation-on-18Lightweight TrickVOS (PT)
F-Measure (Seen): 83.3
F-Measure (Unseen): 84
J score (unseen): 75.2
Ju0026F: 80.5
Jaccard (Seen): 79.5
semi-supervised-video-object-segmentation-on-18STCN + TrickVOS (PT)
F-Measure (Seen): 86.4
F-Measure (Unseen): 85.5
Ju0026F: 82.8
Jaccard (Seen): 82.1
Jaccard (Unseen): 77.2
semi-supervised-video-object-segmentation-on-2Lightweight TrickVOS (PT)
F-measure (Mean): 86
Ju0026F: 82.7
Jaccard (Mean): 79.4
Speed (FPS): 76.4
semi-supervised-video-object-segmentation-on-2STCN + TrickVOS (PT)
F-measure (Mean): 89.6
Ju0026F: 86.1
Jaccard (Mean): 82.6
Speed (FPS): 35.1
semi-supervised-video-object-segmentation-on-3STCN + TrickVOS (PT)
Speed (FPS): 45.4
visual-object-tracking-on-davis-2016STCN + TrickVOS (PT)
F-measure (Mean): 93.1
Ju0026F: 91.8
Jaccard (Mean): 90.5
visual-object-tracking-on-davis-2016Lightweight TrickVOS (PT)
F-measure (Mean): 89.9
Ju0026F: 89.3
Jaccard (Mean): 88.7
Speed (FPS): 86.4

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
TrickVOS: A Bag of Tricks for Video Object Segmentation | Papers | HyperAI