HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

Cross-Domain Semantic Segmentation on Inconsistent Taxonomy using VLMs

Jeongkee Lim Yusung Kim

Cross-Domain Semantic Segmentation on Inconsistent Taxonomy using VLMs

Abstract

The challenge of semantic segmentation in Unsupervised Domain Adaptation (UDA) emerges not only from domain shifts between source and target images but also from discrepancies in class taxonomies across domains. Traditional UDA research assumes consistent taxonomy between the source and target domains, thereby limiting their ability to recognize and adapt to the taxonomy of the target domain. This paper introduces a novel approach, Cross-Domain Semantic Segmentation on Inconsistent Taxonomy using Vision Language Models (CSI), which effectively performs domain-adaptive semantic segmentation even in situations of source-target class mismatches. CSI leverages the semantic generalization potential of Visual Language Models (VLMs) to create synergy with previous UDA methods. It leverages segment reasoning obtained through traditional UDA methods, combined with the rich semantic knowledge embedded in VLMs, to relabel new classes in the target domain. This approach allows for effective adaptation to extended taxonomies without requiring any ground truth label for the target domain. Our method has shown to be effective across various benchmarks in situations of inconsistent taxonomy settings (coarse-to-fine taxonomy and open taxonomy) and demonstrates consistent synergy effects when integrated with previous state-of-the-art UDA methods. The implementation is available at http://github.com/jkee58/CSI.

Code Repositories

jkee58/csi
Official
pytorch

Benchmarks

BenchmarkMethodologyMetrics
unsupervised-domain-adaptation-on-synthia-toDAFormer+CSI
mIoU: 61.4
mIoU (19 classes): 60
unsupervised-domain-adaptation-on-synthia-toMIC+CSI
mIoU: 68.8
mIoU (13 classes): 75.9
mIoU (19 classes): 67.4

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Cross-Domain Semantic Segmentation on Inconsistent Taxonomy using VLMs | Papers | HyperAI