HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

Variational Context-Deformable ConvNets for Indoor Scene Parsing

{ Qi Wang Nianhui Guo Yuan Yuan Zhitong Xiong}

Variational Context-Deformable ConvNets for Indoor Scene Parsing

Abstract

Context information is critical for image semantic segmentation. Especially in indoor scenes, the large variation of object scales makes spatial-context an important factor for improving the segmentation performance. Thus, in this paper, we propose a novel variational context-deformable (VCD) module to learn adaptive receptive-field in a structured fashion. Different from standard ConvNets, which share fixed-size spatial context for all pixels, the VCD module learns a deformable spatial-context with the guidance of depth information: depth information provides clues for identifying real local neighborhoods. Specifically, adaptive Gaussian kernels are learned with the guidance of multimodal information. By multiplying the learned Gaussian kernel with standard convolution filters, the VCD module can aggregate flexible spatial context for each pixel during convolution. The main contributions of this work are as follows: 1) a novel VCD module is proposed, which exploits learnable Gaussian kernels to enable feature learning with structured adaptive-context; 2) variational Bayesian probabilistic modeling is introduced for the training of VCD module, which can make it continuous and more stable; 3) a perspective-aware guidance module is designed to take advantage of multi-modal information for RGB-D segmentation. We evaluate the proposed approach on three widely-used datasets, and the performance improvement has shown the effectiveness of the proposed method.

Benchmarks

BenchmarkMethodologyMetrics
scene-parsing-on-cityscapes-testVCD No Coarse
mIoU: 82.3
semantic-segmentation-on-gamusVCD
mIoU: 59.70
semantic-segmentation-on-nyu-depth-v2VCD+RedNet (ResNet-50)
Mean IoU: 50.7%
semantic-segmentation-on-nyu-depth-v2VCD+ACNet (ResNet-50)
Mean IoU: 51.9%
semantic-segmentation-on-nyu-depth-v2VCD+DeepLab (VGG16)
Mean IoU: 45.3

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp