4 months ago

RedNet: Residual Encoder-Decoder Network for indoor RGB-D Semantic Segmentation

Jindong Jiang; Lunan Zheng; Fei Luo; Zhijun Zhang

Abstract

Indoor semantic segmentation has always been a difficult task in computer vision. In this paper, we propose an RGB-D residual encoder-decoder architecture, named RedNet, for indoor RGB-D semantic segmentation. In RedNet, the residual module is applied to both the encoder and decoder as the basic building block, and the skip-connection is used to bypass the spatial feature between the encoder and decoder. In order to incorporate the depth information of the scene, a fusion structure is constructed, which makes inference on RGB image and depth image separately, and fuses their features over several layers. In order to efficiently optimize the network's parameters, we propose a `pyramid supervision' training scheme, which applies supervised learning over different layers in the decoder, to cope with the problem of gradients vanishing. Experiment results show that the proposed RedNet(ResNet-50) achieves a state-of-the-art mIoU accuracy of 47.8% on the SUN RGB-D benchmark dataset.

Code Repositories

code-implementation1/Code7/tree/main/REDNet30

mindspore

lyqcom/rednet30

mindspore

JindongJiang/RedNet

Official

pytorch

Mentioned in GitHub

MindSpore-paper-code-2/code2/tree/main/REDNet30

mindspore

2023-MindSpore-4/Code6/tree/main/REDNet30

dodoseung/rednet-residual-encoder-decoder-network-pytorch

pytorch

2023-MindSpore-1/ms-code-216/tree/main/REDNet30

mindspore

MindSpore-paper-code-3/code5/tree/main/REDNet30

mindspore

Benchmarks

Benchmark	Methodology	Metrics
semantic-segmentation-on-nyu-depth-v2	RedNet	Mean IoU: 47.2%
semantic-segmentation-on-sun-rgbd	TokenFusion (Ti)	Mean IoU: 47.8%
semantic-segmentation-on-thud-robotic-dataset	RedNet	mIoU: 76.92

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started

Hyper Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

Command Palette

RedNet: Residual Encoder-Decoder Network for indoor RGB-D Semantic Segmentation

Jindong Jiang; Lunan Zheng; Fei Luo; Zhijun Zhang

Abstract

Code Repositories

Benchmarks

Build AI with AI

Hyper Newsletters