HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

OneFormer3D: One Transformer for Unified Point Cloud Segmentation

Maxim Kolodiazhnyi Anna Vorontsova Anton Konushin Danila Rukhovich

OneFormer3D: One Transformer for Unified Point Cloud Segmentation

Abstract

Semantic, instance, and panoptic segmentation of 3D point clouds have been addressed using task-specific models of distinct design. Thereby, the similarity of all segmentation tasks and the implicit relationship between them have not been utilized effectively. This paper presents a unified, simple, and effective model addressing all these tasks jointly. The model, named OneFormer3D, performs instance and semantic segmentation consistently, using a group of learnable kernels, where each kernel is responsible for generating a mask for either an instance or a semantic category. These kernels are trained with a transformer-based decoder with unified instance and semantic queries passed as an input. Such a design enables training a model end-to-end in a single run, so that it achieves top performance on all three segmentation tasks simultaneously. Specifically, our OneFormer3D ranks 1st and sets a new state-of-the-art (+2.1 mAP50) in the ScanNet test leaderboard. We also demonstrate the state-of-the-art results in semantic, instance, and panoptic segmentation of ScanNet (+21 PQ), ScanNet200 (+3.8 mAP50), and S3DIS (+0.8 mIoU) datasets.

Code Repositories

oneformer3d/oneformer3d
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
3d-instance-segmentation-on-s3disOneFormer3D
AP@50: 75.8
mAP: 63.0
mPrec: 82.3
mRec: 74.1
3d-instance-segmentation-on-scannetv2OneFromer3D
mAP: 56.6
mAP @ 50: 80.1
mAP@25: 89.6
3d-object-detection-on-scannetv2OneFormer3D
mAP@0.25: 76.9
mAP@0.5: 65.3
3d-semantic-segmentation-on-s3disOneFormer3D
mIoU (6-Fold): 75.0
mIoU (Area-5): 72.4
3d-semantic-segmentation-on-scannet200OneFormer3D
val mIoU: 30.1
panoptic-segmentation-on-scannetOneFormer3D
PQ: 71.2
PQ_st: 86.1
PQ_th: 69.6
panoptic-segmentation-on-scannetv2OneFormer3D
PQ: 71.2
semantic-segmentation-on-scannetOneFormer3D
val mIoU: 76.6

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
OneFormer3D: One Transformer for Unified Point Cloud Segmentation | Papers | HyperAI