HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

Regress Before Construct: Regress Autoencoder for Point Cloud Self-supervised Learning

Liu Yang ; Chen Chen ; Wang Can ; King Xulin ; Liu Mengyuan

Regress Before Construct: Regress Autoencoder for Point Cloud
  Self-supervised Learning

Abstract

Masked Autoencoders (MAE) have demonstrated promising performance inself-supervised learning for both 2D and 3D computer vision. Nevertheless,existing MAE-based methods still have certain drawbacks. Firstly, thefunctional decoupling between the encoder and decoder is incomplete, whichlimits the encoder's representation learning ability. Secondly, downstreamtasks solely utilize the encoder, failing to fully leverage the knowledgeacquired through the encoder-decoder architecture in the pre-text task. In thispaper, we propose Point Regress AutoEncoder (Point-RAE), a new scheme forregressive autoencoders for point cloud self-supervised learning. The proposedmethod decouples functions between the decoder and the encoder by introducing amask regressor, which predicts the masked patch representation from the visiblepatch representation encoded by the encoder and the decoder reconstructs thetarget from the predicted masked patch representation. By doing so, we minimizethe impact of decoder updates on the representation space of the encoder.Moreover, we introduce an alignment constraint to ensure that therepresentations for masked patches, predicted from the encoded representationsof visible patches, are aligned with the masked patch presentations computedfrom the encoder. To make full use of the knowledge learned in the pre-trainingstage, we design a new finetune mode for the proposed Point-RAE. Extensiveexperiments demonstrate that our approach is efficient during pre-training andgeneralizes well on various downstream tasks. Specifically, our pre-trainedmodels achieve a high accuracy of \textbf{90.28\%} on the ScanObjectNN hardestsplit and \textbf{94.1\%} accuracy on ModelNet40, surpassing all the otherself-supervised learning methods. Our code and pretrained model are publicavailable at: \url{https://github.com/liuyyy111/Point-RAE}.

Code Repositories

liuyyy111/point-rae
Official
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
3d-point-cloud-classification-on-modelnet40Point-RAE
Overall Accuracy: 94.1
3d-point-cloud-classification-on-scanobjectnnPoint-RAE (no voting)
OBJ-BG (OA): 95.53
OBJ-ONLY (OA): 93.63
Overall Accuracy: 90.28
few-shot-3d-point-cloud-classification-on-1Point-RAE
Overall Accuracy: 97.3
Standard Deviation: 1.6
few-shot-3d-point-cloud-classification-on-2Point-RAE
Overall Accuracy: 98.7
Standard Deviation: 1.3
few-shot-3d-point-cloud-classification-on-3Point-RAE
Overall Accuracy: 93.3
Standard Deviation: 4.0
few-shot-3d-point-cloud-classification-on-4Point-RAE
Overall Accuracy: 95.8
Standard Deviation: 3.0

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Regress Before Construct: Regress Autoencoder for Point Cloud Self-supervised Learning | Papers | HyperAI