HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

Spatiotemporal Graph Neural Network based Mask Reconstruction for Video Object Segmentation

Liu Daizong ; Xu Shuangjie ; Liu Xiao-Yang ; Xu Zichuan ; Wei Wei ; Zhou Pan

Spatiotemporal Graph Neural Network based Mask Reconstruction for Video
  Object Segmentation

Abstract

This paper addresses the task of segmenting class-agnostic objects insemi-supervised setting. Although previous detection based methods achieverelatively good performance, these approaches extract the best proposal by agreedy strategy, which may lose the local patch details outside the chosencandidate. In this paper, we propose a novel spatiotemporal graph neuralnetwork (STG-Net) to reconstruct more accurate masks for video objectsegmentation, which captures the local contexts by utilizing all proposals. Inthe spatial graph, we treat object proposals of a frame as nodes and representtheir correlations with an edge weight strategy for mask context aggregation.To capture temporal information from previous frames, we use a memory networkto refine the mask of current frame by retrieving historic masks in a temporalgraph. The joint use of both local patch details and temporal relationshipsallow us to better address the challenges such as object occlusion and missing.Without online learning and fine-tuning, our STG-Net achieves state-of-the-artperformance on four large benchmarks (DAVIS, YouTube-VOS, SegTrack-v2, andYouTube-Objects), demonstrating the effectiveness of the proposed approach.

Benchmarks

BenchmarkMethodologyMetrics
semi-supervised-video-object-segmentation-on-20STG-Net
D16 val (F): 86.0
D16 val (G): 85.7
D16 val (J): 85.4
D17 test (F): 66.5
D17 test (G): 63.1
D17 test (J): 59.7
D17 val (F): 77.9
D17 val (G): 74.7
D17 val (J): 71.5

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Spatiotemporal Graph Neural Network based Mask Reconstruction for Video Object Segmentation | Papers | HyperAI