Command Palette
Search for a command to run...
OpenIns3D: Snap and Lookup for 3D Open-vocabulary Instance Segmentation
Huang Zhening ; Wu Xiaoyang ; Chen Xi ; Zhao Hengshuang ; Zhu Lei ; Lasenby Joan

Abstract
In this work, we introduce OpenIns3D, a new 3D-input-only framework for 3Dopen-vocabulary scene understanding. The OpenIns3D framework employs a"Mask-Snap-Lookup" scheme. The "Mask" module learns class-agnostic maskproposals in 3D point clouds, the "Snap" module generates synthetic scene-levelimages at multiple scales and leverages 2D vision-language models to extractinteresting objects, and the "Lookup" module searches through the outcomes of"Snap" to assign category names to the proposed masks. This approach, yetsimple, achieves state-of-the-art performance across a wide range of 3Dopen-vocabulary tasks, including recognition, object detection, and instancesegmentation, on both indoor and outdoor datasets. Moreover, OpenIns3Dfacilitates effortless switching between different 2D detectors withoutrequiring retraining. When integrated with powerful 2D open-world models, itachieves excellent results in scene understanding tasks. Furthermore, whencombined with LLM-powered 2D models, OpenIns3D exhibits an impressivecapability to comprehend and process highly complex text queries that demandintricate reasoning and real-world knowledge. Project page:https://zheninghuang.github.io/OpenIns3D/
Code Repositories
Benchmarks
| Benchmark | Methodology | Metrics |
|---|---|---|
| 3d-open-vocabulary-instance-segmentation-on | OpenIns3D (3d only) | AP Common: 6.5 AP Head: 16.0 AP Tail: 4.2 AP25: 14.4 AP50: 10.3 mAP: 8.8 |
| 3d-open-vocabulary-instance-segmentation-on | OpenIns3D | AP Common: 14.2 AP Head: 19.2 AP Tail: 14.2 AP25: 23.3 AP50: 20.6 mAP: 15.9 |
| 3d-open-vocabulary-instance-segmentation-on-1 | OpenIns3D | mAP: 15.4 |
| 3d-open-vocabulary-instance-segmentation-on-1 | OpenIns3D (with rgbd) | mAP: 21.1 |
| 3d-open-vocabulary-instance-segmentation-on-2 | OpenIns3D | AP50 Novel B6/N6: 33.0 AP50 Novel B8/N4: 37.0 |
| 3d-open-vocabulary-instance-segmentation-on-3 | OPENINS3D | AP50: 13.3 |
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.