Command Palette
Search for a command to run...
Self-Supervised Learning for Large-Scale Unsupervised Image Clustering
Evgenii Zheltonozhskii; Chaim Baskin; Alex M. Bronstein; Avi Mendelson

Abstract
Unsupervised learning has always been appealing to machine learning researchers and practitioners, allowing them to avoid an expensive and complicated process of labeling the data. However, unsupervised learning of complex data is challenging, and even the best approaches show much weaker performance than their supervised counterparts. Self-supervised deep learning has become a strong instrument for representation learning in computer vision. However, those methods have not been evaluated in a fully unsupervised setting. In this paper, we propose a simple scheme for unsupervised classification based on self-supervised representations. We evaluate the proposed approach with several recent self-supervised methods showing that it achieves competitive results for ImageNet classification (39% accuracy on ImageNet with 1000 clusters and 46% with overclustering). We suggest adding the unsupervised evaluation to a set of standard benchmarks for self-supervised learning. The code is available at https://github.com/Randl/kmeans_selfsuper
Code Repositories
Benchmarks
| Benchmark | Methodology | Metrics |
|---|---|---|
| image-classification-on-objectnet | BigBiGAN (RevNet-50 4×) | Top-1 Accuracy: 4.92 |
| unsupervised-image-classification-on | SimCLRv2 ResNet-152 + SK (PCA+k-means, 1500 clusters) | ARI: 1.32±0.05 Accuracy (%): 6.47±0.07 |
| unsupervised-image-classification-on | InfoMin ResNeXt-152 + SK (PCA+k-means) | ARI: 1.59±0.04 Accuracy (%): 6.53±0.19 |
| unsupervised-image-classification-on-imagenet | SimCLRv2 ResNet-152 + SK (PCA+k-means, 1500 clusters) | ARI: 23.94±0.16 Accuracy (%): 46.03±0.21 |
| unsupervised-image-classification-on-imagenet | SimCLRv2 ResNet-152 + SK (PCA+k-means) | ARI: 22.80±0.60 Accuracy (%): 39.07±0.61 |
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.