Command Palette
Search for a command to run...
Open3DIS: Open-Vocabulary 3D Instance Segmentation with 2D Mask Guidance
Nguyen Phuc D. A. ; Ngo Tuan Duc ; Kalogerakis Evangelos ; Gan Chuang ; Tran Anh ; Pham Cuong ; Nguyen Khoi

Abstract
We introduce Open3DIS, a novel solution designed to tackle the problem ofOpen-Vocabulary Instance Segmentation within 3D scenes. Objects within 3Denvironments exhibit diverse shapes, scales, and colors, making preciseinstance-level identification a challenging task. Recent advancements inOpen-Vocabulary scene understanding have made significant strides in this areaby employing class-agnostic 3D instance proposal networks for objectlocalization and learning queryable features for each 3D mask. While thesemethods produce high-quality instance proposals, they struggle with identifyingsmall-scale and geometrically ambiguous objects. The key idea of our method isa new module that aggregates 2D instance masks across frames and maps them togeometrically coherent point cloud regions as high-quality object proposalsaddressing the above limitations. These are then combined with 3Dclass-agnostic instance proposals to include a wide range of objects in thereal world. To validate our approach, we conducted experiments on threeprominent datasets, including ScanNet200, S3DIS, and Replica, demonstratingsignificant performance gains in segmenting objects with diverse categoriesover the state-of-the-art approaches.
Code Repositories
Benchmarks
| Benchmark | Methodology | Metrics |
|---|---|---|
| 3d-instance-segmentation-on-scannet200 | Open3DIS (Open-Vocabulary) | mAP: 23.7 |
| 3d-open-vocabulary-instance-segmentation-on | Open3DIS | AP Common: 21.2 AP Head: 27.8 AP Tail: 21.8 AP25: 32.8 AP50: 29.4 mAP: 23.7 |
| 3d-open-vocabulary-instance-segmentation-on-1 | Open3DIS | mAP: 18.1 |
| 3d-open-vocabulary-instance-segmentation-on-2 | Open3DIS | AP50 Base B6/N6: 50.0 AP50 Base B8/N4 : 60.8 AP50 Novel B6/N6: 29.0 AP50 Novel B8/N4: 26.3 |
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.