HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

Mamba3D: Enhancing Local Features for 3D Point Cloud Analysis via State Space Model

Han Xu ; Tang Yuan ; Wang Zhaoxuan ; Li Xianzhi

Mamba3D: Enhancing Local Features for 3D Point Cloud Analysis via State
  Space Model

Abstract

Existing Transformer-based models for point cloud analysis suffer fromquadratic complexity, leading to compromised point cloud resolution andinformation loss. In contrast, the newly proposed Mamba model, based on statespace models (SSM), outperforms Transformer in multiple areas with only linearcomplexity. However, the straightforward adoption of Mamba does not achievesatisfactory performance on point cloud tasks. In this work, we presentMamba3D, a state space model tailored for point cloud learning to enhance localfeature extraction, achieving superior performance, high efficiency, andscalability potential. Specifically, we propose a simple yet effective LocalNorm Pooling (LNP) block to extract local geometric features. Additionally, toobtain better global features, we introduce a bidirectional SSM (bi-SSM) withboth a token forward SSM and a novel backward SSM that operates on the featurechannel. Extensive experimental results show that Mamba3D surpassesTransformer-based counterparts and concurrent works in multiple tasks, with orwithout pre-training. Notably, Mamba3D achieves multiple SoTA, including anoverall accuracy of 92.6% (train from scratch) on the ScanObjectNN and 95.1%(with single-modal pre-training) on the ModelNet40 classification task, withonly linear complexity. Our code and weights are available athttps://github.com/xhanxu/Mamba3D.

Code Repositories

xhanxu/Mamba3D
Official
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
3d-point-cloud-classification-on-modelnet40Mamba3D + Point-MAE
FLOPs: 3.9G
Number of params: 16.9M
Overall Accuracy: 95.1
3d-point-cloud-classification-on-scanobjectnnMamba3D (no voting)
FLOPs: 3.9G
Number of params: 16.9M
OBJ-BG (OA): 92.94
OBJ-ONLY (OA): 92.08
Overall Accuracy: 91.81
3d-point-cloud-classification-on-scanobjectnnMamba3D
FLOPs: 3.9G
Number of params: 16.9M
OBJ-BG (OA): 94.49
OBJ-ONLY (OA): 92.43
Overall Accuracy: 92.64
supervised-only-3d-point-cloud-classificationMamba3D
GFLOPs: 3.9
Number of params (M): 16.9
Overall Accuracy (PB_T50_RS): 92.64
supervised-only-3d-point-cloud-classificationMamba3D (no voting)
GFLOPs: 3.9
Number of params (M): 16.9
Overall Accuracy (PB_T50_RS): 91.81

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Mamba3D: Enhancing Local Features for 3D Point Cloud Analysis via State Space Model | Papers | HyperAI