HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

Kernelized Memory Network for Video Object Segmentation

Hongje Seong Junhyuk Hyun Euntai Kim

Kernelized Memory Network for Video Object Segmentation

Abstract

Semi-supervised video object segmentation (VOS) is a task that involves predicting a target object in a video when the ground truth segmentation mask of the target object is given in the first frame. Recently, space-time memory networks (STM) have received significant attention as a promising solution for semi-supervised VOS. However, an important point is overlooked when applying STM to VOS. The solution (STM) is non-local, but the problem (VOS) is predominantly local. To solve the mismatch between STM and VOS, we propose a kernelized memory network (KMN). Before being trained on real videos, our KMN is pre-trained on static images, as in previous works. Unlike in previous works, we use the Hide-and-Seek strategy in pre-training to obtain the best possible results in handling occlusions and segment boundary extraction. The proposed KMN surpasses the state-of-the-art on standard benchmarks by a significant margin (+5% on DAVIS 2017 test-dev set). In addition, the runtime of KMN is 0.12 seconds per frame on the DAVIS 2016 validation set, and the KMN rarely requires extra computation, when compared with STM.

Code Repositories

hkchengrex/Mask-Propagation
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
semi-supervised-video-object-segmentation-on-1KMN
F-measure (Mean): 80.3
Ju0026F: 77.2
Jaccard (Mean): 74.1
semi-supervised-video-object-segmentation-on-20KMN
D16 val (F): 88.1
D16 val (G): 87.6
D16 val (J): 87.1
D17 val (F): 77.8
D17 val (G): 76.0
D17 val (J): 74.2
FPS: 8.33
video-object-segmentation-on-youtube-vosKMN
F-Measure (Seen): 85.6
F-Measure (Unseen): 83.3
Jaccard (Seen): 81.4
Jaccard (Unseen): 75.3
Overall: 81.4
visual-object-tracking-on-davis-2016KMN
F-measure (Mean): 91.5
Ju0026F: 90.5
Jaccard (Mean): 89.5
visual-object-tracking-on-davis-2017KMN
F-measure (Mean): 85.6
Ju0026F: 82.8
Jaccard (Mean): 80

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Kernelized Memory Network for Video Object Segmentation | Papers | HyperAI