HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

MasQCLIP for Open-Vocabulary Universal Image Segmentation

{Zhuowen Tu Zheng Ding Tianyi Xiong Xin Xu}

MasQCLIP for Open-Vocabulary Universal Image Segmentation

Abstract

We present a new method for open-vocabulary universal image segmentation, which is capable of performing instance, semantic, and panoptic segmentation under a unified framework. Our approach, called MasQCLIP, seamlessly integrates with a pre-trained CLIP model by utilizing its dense features, thereby circumventing the need for extensive parameter training. MasQCLIP emphasizes two new aspects when building an image segmentation method with a CLIP model: 1) a student-teacher module to deal with masks of the novel (unseen) classes by distilling information from the base (seen) classes; 2) a fine-tuning process to update model parameters for the queries Q within the CLIP model. Thanks to these two simple and intuitive designs, MasQCLIP is able to achieve state-of-the-art performances with a substantial gain over the competing methods by a large margin across all three tasks, including open-vocabulary instance, semantic, and panoptic segmentation. Project page is at https://masqclip.github.io/.

Benchmarks

BenchmarkMethodologyMetrics
panoptic-segmentation-on-ade20kMasQCLIP
PQ: 23.3

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
MasQCLIP for Open-Vocabulary Universal Image Segmentation | Papers | HyperAI