HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

Efficient Regional Memory Network for Video Object Segmentation

Xie Haozhe ; Yao Hongxun ; Zhou Shangchen ; Zhang Shengping ; Sun Wenxiu

Efficient Regional Memory Network for Video Object Segmentation

Abstract

Recently, several Space-Time Memory based networks have shown that the objectcues (e.g. video frames as well as the segmented object masks) from the pastframes are useful for segmenting objects in the current frame. However, thesemethods exploit the information from the memory by global-to-global matchingbetween the current and past frames, which lead to mismatching to similarobjects and high computational complexity. To address these problems, wepropose a novel local-to-local matching solution for semi-supervised VOS,namely Regional Memory Network (RMNet). In RMNet, the precise regional memoryis constructed by memorizing local regions where the target objects appear inthe past frames. For the current query frame, the query regions are tracked andpredicted based on the optical flow estimated from the previous frame. Theproposed local-to-local matching effectively alleviates the ambiguity ofsimilar objects in both memory and query frames, which allows the informationto be passed from the regional memory to the query region efficiently andeffectively. Experimental results indicate that the proposed RMNet performsfavorably against state-of-the-art methods on the DAVIS and YouTube-VOSdatasets.

Code Repositories

hzxie/RMNet
Official
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
semi-supervised-video-object-segmentation-on-1RMNet
F-measure (Mean): 78.1
Ju0026F: 75.0
Jaccard (Mean): 71.9
semi-supervised-video-object-segmentation-on-20RMNet
D16 val (F): 82.3
D16 val (G): 81.5
D16 val (J): 80.6
D17 val (F): 77.2
D17 val (G): 75.0
D17 val (J): 72.8
FPS: 11.9
video-object-segmentation-on-youtube-vosRMNet
F-Measure (Seen): 85.7
F-Measure (Unseen): 82.4
Jaccard (Seen): 82.1
Jaccard (Unseen): 75.7
Overall: 81.5
visual-object-tracking-on-davis-2016RMNet
F-measure (Mean): 88.7
Ju0026F: 88.8
Jaccard (Mean): 88.9
visual-object-tracking-on-davis-2017RMNet
F-measure (Mean): 86.0
Ju0026F: 83.5
Jaccard (Mean): 81.0

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Efficient Regional Memory Network for Video Object Segmentation | Papers | HyperAI