HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

Mean Shift Mask Transformer for Unseen Object Instance Segmentation

Yangxiao Lu; Yuqiao Chen; Nicholas Ruozzi; Yu Xiang

Mean Shift Mask Transformer for Unseen Object Instance Segmentation

Abstract

Segmenting unseen objects from images is a critical perception skill that a robot needs to acquire. In robot manipulation, it can facilitate a robot to grasp and manipulate unseen objects. Mean shift clustering is a widely used method for image segmentation tasks. However, the traditional mean shift clustering algorithm is not differentiable, making it difficult to integrate it into an end-to-end neural network training framework. In this work, we propose the Mean Shift Mask Transformer (MSMFormer), a new transformer architecture that simulates the von Mises-Fisher (vMF) mean shift clustering algorithm, allowing for the joint training and inference of both the feature extractor and the clustering. Its central component is a hypersphere attention mechanism, which updates object queries on a hypersphere. To illustrate the effectiveness of our method, we apply MSMFormer to unseen object instance segmentation. Our experiments show that MSMFormer achieves competitive performance compared to state-of-the-art methods for unseen object instance segmentation. The project page, appendix, video, and code are available at https://irvlutd.github.io/MSMFormer

Code Repositories

youngsean/unseenobjectswithmeanshift
Official
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
unseen-object-instance-segmentation-on-ocidMSMFormer
Boundary F-measure: 87.3

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Mean Shift Mask Transformer for Unseen Object Instance Segmentation | Papers | HyperAI