Command Palette
Search for a command to run...
Mamba3D: Enhancing Local Features for 3D Point Cloud Analysis via State Space Model
Han Xu ; Tang Yuan ; Wang Zhaoxuan ; Li Xianzhi

Abstract
Existing Transformer-based models for point cloud analysis suffer fromquadratic complexity, leading to compromised point cloud resolution andinformation loss. In contrast, the newly proposed Mamba model, based on statespace models (SSM), outperforms Transformer in multiple areas with only linearcomplexity. However, the straightforward adoption of Mamba does not achievesatisfactory performance on point cloud tasks. In this work, we presentMamba3D, a state space model tailored for point cloud learning to enhance localfeature extraction, achieving superior performance, high efficiency, andscalability potential. Specifically, we propose a simple yet effective LocalNorm Pooling (LNP) block to extract local geometric features. Additionally, toobtain better global features, we introduce a bidirectional SSM (bi-SSM) withboth a token forward SSM and a novel backward SSM that operates on the featurechannel. Extensive experimental results show that Mamba3D surpassesTransformer-based counterparts and concurrent works in multiple tasks, with orwithout pre-training. Notably, Mamba3D achieves multiple SoTA, including anoverall accuracy of 92.6% (train from scratch) on the ScanObjectNN and 95.1%(with single-modal pre-training) on the ModelNet40 classification task, withonly linear complexity. Our code and weights are available athttps://github.com/xhanxu/Mamba3D.
Code Repositories
Benchmarks
| Benchmark | Methodology | Metrics |
|---|---|---|
| 3d-point-cloud-classification-on-modelnet40 | Mamba3D + Point-MAE | FLOPs: 3.9G Number of params: 16.9M Overall Accuracy: 95.1 |
| 3d-point-cloud-classification-on-scanobjectnn | Mamba3D (no voting) | FLOPs: 3.9G Number of params: 16.9M OBJ-BG (OA): 92.94 OBJ-ONLY (OA): 92.08 Overall Accuracy: 91.81 |
| 3d-point-cloud-classification-on-scanobjectnn | Mamba3D | FLOPs: 3.9G Number of params: 16.9M OBJ-BG (OA): 94.49 OBJ-ONLY (OA): 92.43 Overall Accuracy: 92.64 |
| supervised-only-3d-point-cloud-classification | Mamba3D | GFLOPs: 3.9 Number of params (M): 16.9 Overall Accuracy (PB_T50_RS): 92.64 |
| supervised-only-3d-point-cloud-classification | Mamba3D (no voting) | GFLOPs: 3.9 Number of params (M): 16.9 Overall Accuracy (PB_T50_RS): 91.81 |
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.