HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

Missing Modality Robustness in Semi-Supervised Multi-Modal Semantic Segmentation

Harsh Maheshwari Yen-Cheng Liu Zsolt Kira

Missing Modality Robustness in Semi-Supervised Multi-Modal Semantic Segmentation

Abstract

Using multiple spatial modalities has been proven helpful in improving semantic segmentation performance. However, there are several real-world challenges that have yet to be addressed: (a) improving label efficiency and (b) enhancing robustness in realistic scenarios where modalities are missing at the test time. To address these challenges, we first propose a simple yet efficient multi-modal fusion mechanism Linear Fusion, that performs better than the state-of-the-art multi-modal models even with limited supervision. Second, we propose M3L: Multi-modal Teacher for Masked Modality Learning, a semi-supervised framework that not only improves the multi-modal performance but also makes the model robust to the realistic missing modality scenario using unlabeled data. We create the first benchmark for semi-supervised multi-modal semantic segmentation and also report the robustness to missing modalities. Our proposal shows an absolute improvement of up to 10% on robust mIoU above the most competitive baselines. Our code is available at https://github.com/harshm121/M3L

Code Repositories

harshm121/m3l
Official
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
semantic-segmentation-on-stanford2d3d-rgbdLinear Fusion (Segformer B2)
mIoU: 57.16
semantic-segmentation-on-sun-rgbdDFormer-L
Mean IoU (test): 48.17
semi-supervised-semantic-segmentation-on-2dM3L (Linear Fusion B2)
mIoU (0.1% labels): 40.05
mIoU (0.2% labels): 44.62
mIoU (1% labels): 49.28
semi-supervised-semantic-segmentation-on-32M3L (Linear Fusion - Segformer B2)
MM-Robust mIoU (0.1% labels): 41.36
mIoU (0.1% labels): 44.1
semi-supervised-semantic-segmentation-on-32Mean Teacher (Linear Fusion - Segformer B2)
mIoU (0.1% labels): 41.7

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Missing Modality Robustness in Semi-Supervised Multi-Modal Semantic Segmentation | Papers | HyperAI