HyperAIHyperAI

Command Palette

Search for a command to run...

4 months ago

Semantic Video Segmentation by Gated Recurrent Flow Propagation

David Nilsson; Cristian Sminchisescu

Semantic Video Segmentation by Gated Recurrent Flow Propagation

Abstract

Semantic video segmentation is challenging due to the sheer amount of data that needs to be processed and labeled in order to construct accurate models. In this paper we present a deep, end-to-end trainable methodology to video segmentation that is capable of leveraging information present in unlabeled data in order to improve semantic estimates. Our model combines a convolutional architecture and a spatio-temporal transformer recurrent layer that are able to temporally propagate labeling information by means of optical flow, adaptively gated based on its locally estimated uncertainty. The flow, the recognition and the gated temporal propagation modules can be trained jointly, end-to-end. The temporal, gated recurrent flow propagation component of our model can be plugged into any static semantic segmentation architecture and turn it into a weakly supervised video processing one. Our extensive experiments in the challenging CityScapes and Camvid datasets, and based on multiple deep architectures, indicate that the resulting model can leverage unlabeled temporal frames, next to a labeled one, in order to improve both the video segmentation accuracy and the consistency of its temporal labeling, at no additional annotation cost and with little extra computation.

Benchmarks

BenchmarkMethodologyMetrics
video-semantic-segmentation-on-camvidGRFP
Mean IoU: 67.1
video-semantic-segmentation-on-cityscapes-valGRFP [15]
mIoU: 73.6

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Semantic Video Segmentation by Gated Recurrent Flow Propagation | Papers | HyperAI