Command Palette
Search for a command to run...
Mathilde Caron; Piotr Bojanowski; Armand Joulin; Matthijs Douze

Abstract
Clustering is a class of unsupervised learning methods that has been extensively applied and studied in computer vision. Little work has been done to adapt it to the end-to-end training of visual features on large scale datasets. In this work, we present DeepCluster, a clustering method that jointly learns the parameters of a neural network and the cluster assignments of the resulting features. DeepCluster iteratively groups the features with a standard clustering algorithm, k-means, and uses the subsequent assignments as supervision to update the weights of the network. We apply DeepCluster to the unsupervised training of convolutional neural networks on large datasets like ImageNet and YFCC100M. The resulting model outperforms the current state of the art by a significant margin on all the standard benchmarks.
Code Repositories
Benchmarks
| Benchmark | Methodology | Metrics |
|---|---|---|
| image-clustering-on-cifar-10 | DeepCluster | ARI: - Accuracy: 0.374 Backbone: ResNet-34 NMI: - Train set: Train+Test |
| image-clustering-on-cifar-100 | DeeperCluster | Accuracy: 0.189 Train Set: Train+Test |
| self-supervised-image-classification-on | DeepCluster (AlexNet) | Number of Params: 61M Top 1 Accuracy: 41.0 |
| unsupervised-semantic-segmentation-on | MDC | Accuracy: 40.7 mIoU: 7.1 |
| unsupervised-semantic-segmentation-on-6 | MDC (Supervised pretrain) | mIoU (test): 14.3 mIoU (val): 14.6 |
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.