Command Palette
Search for a command to run...
Contrast with Reconstruct: Contrastive 3D Representation Learning Guided by Generative Pretraining
Qi Zekun ; Dong Runpei ; Fan Guofan ; Ge Zheng ; Zhang Xiangyu ; Ma Kaisheng ; Yi Li

Abstract
Mainstream 3D representation learning approaches are built upon contrastiveor generative modeling pretext tasks, where great improvements in performanceon various downstream tasks have been achieved. However, we find these twoparadigms have different characteristics: (i) contrastive models aredata-hungry that suffer from a representation over-fitting issue; (ii)generative models have a data filling issue that shows inferior data scalingcapacity compared to contrastive models. This motivates us to learn 3Drepresentations by sharing the merits of both paradigms, which is non-trivialdue to the pattern difference between the two paradigms. In this paper, wepropose Contrast with Reconstruct (ReCon) that unifies these two paradigms.ReCon is trained to learn from both generative modeling teachers andsingle/cross-modal contrastive teachers through ensemble distillation, wherethe generative student guides the contrastive student. An encoder-decoder styleReCon-block is proposed that transfers knowledge through cross attention withstop-gradient, which avoids pretraining over-fitting and pattern differenceissues. ReCon achieves a new state-of-the-art in 3D representation learning,e.g., 91.26% accuracy on ScanObjectNN. Codes have been released athttps://github.com/qizekun/ReCon.
Code Repositories
Benchmarks
| Benchmark | Methodology | Metrics |
|---|---|---|
| 3d-point-cloud-classification-on-modelnet40 | ReCon | Overall Accuracy: 94.7 |
| 3d-point-cloud-classification-on-scanobjectnn | ReCon (no voting) | OBJ-BG (OA): 95.18 OBJ-ONLY (OA): 93.29 Overall Accuracy: 90.63 |
| 3d-point-cloud-classification-on-scanobjectnn | ReCon | OBJ-BG (OA): 95.35 OBJ-ONLY (OA): 93.80 Overall Accuracy: 91.26 |
| 3d-point-cloud-linear-classification-on | ReCon | Overall Accuracy: 93.4 |
| few-shot-3d-point-cloud-classification-on-1 | ReCon | Overall Accuracy: 97.3 Standard Deviation: 1.9 |
| few-shot-3d-point-cloud-classification-on-2 | ReCon | Overall Accuracy: 98.9 Standard Deviation: 1.2 |
| few-shot-3d-point-cloud-classification-on-3 | ReCon | Overall Accuracy: 93.3 Standard Deviation: 3.9 |
| few-shot-3d-point-cloud-classification-on-4 | ReCon | Overall Accuracy: 95.8 Standard Deviation: 3.0 |
| zero-shot-transfer-3d-point-cloud | ReCon | Accuracy (%): 61.7 |
| zero-shot-transfer-3d-point-cloud-1 | ReCon | Accuracy (%): 75.6 |
| zero-shot-transfer-3d-point-cloud-2 | ReCon | OBJ_BG Accuracy(%): 40.4 OBJ_ONLY Accuracy(%): 43.7 PB_T50_RS Accuracy (%): 30.5 |
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.