Command Palette
Search for a command to run...
Li Yunfan ; Yang Mouxing ; Peng Dezhong ; Li Taihao ; Huang Jiantao ; Peng Xi

Abstract
This paper proposes to perform online clustering by conducting twincontrastive learning (TCL) at the instance and cluster level. Specifically, wefind that when the data is projected into a feature space with a dimensionalityof the target cluster number, the rows and columns of its feature matrixcorrespond to the instance and cluster representation, respectively. Based onthe observation, for a given dataset, the proposed TCL first constructspositive and negative pairs through data augmentations. Thereafter, in the rowand column space of the feature matrix, instance- and cluster-level contrastivelearning are respectively conducted by pulling together positive pairs whilepushing apart the negatives. To alleviate the influence of intrinsicfalse-negative pairs and rectify cluster assignments, we adopt aconfidence-based criterion to select pseudo-labels for boosting both theinstance- and cluster-level contrastive learning. As a result, the clusteringperformance is further improved. Besides the elegant idea of twin contrastivelearning, another advantage of TCL is that it could independently predict thecluster assignment for each instance, thus effortlessly fitting onlinescenarios. Extensive experiments on six widely-used image and text benchmarksdemonstrate the effectiveness of TCL. The code will be released on GitHub.
Code Repositories
Benchmarks
| Benchmark | Methodology | Metrics |
|---|---|---|
| image-clustering-on-cifar-10 | TCL | ARI: 0.780 Accuracy: 0.887 Backbone: ResNet-34 NMI: 0.819 Train set: Train |
| image-clustering-on-cifar-100 | TCL | ARI: 0.357 Accuracy: 0.531 NMI: 0.529 Train Set: Train |
| image-clustering-on-imagenet-10 | TCL | ARI: 0.837 Accuracy: 0.895 NMI: 0.875 |
| image-clustering-on-imagenet-dog-15 | TCL | ARI: 0.516 Accuracy: 0.644 NMI: 0.623 |
| image-clustering-on-stl-10 | TCL | ARI: 0.757 Accuracy: 0.868 Backbone: ResNet-34 NMI: 0.799 Train Split: Train |
| short-text-clustering-on-biomedical | TCL | Acc: 49.8 NMI: 42.9 |
| short-text-clustering-on-stackoverflow | TCL | Acc: 88.2 NMI: 0.786 |
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.