HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

Stable Cluster Discrimination for Deep Clustering

Qi Qian

Stable Cluster Discrimination for Deep Clustering

Abstract

Deep clustering can optimize representations of instances (i.e., representation learning) and explore the inherent data distribution (i.e., clustering) simultaneously, which demonstrates a superior performance over conventional clustering methods with given features. However, the coupled objective implies a trivial solution that all instances collapse to the uniform features. To tackle the challenge, a two-stage training strategy is developed for decoupling, where it introduces an additional pre-training stage for representation learning and then fine-tunes the obtained model for clustering. Meanwhile, one-stage methods are developed mainly for representation learning rather than clustering, where various constraints for cluster assignments are designed to avoid collapsing explicitly. Despite the success of these methods, an appropriate learning objective tailored for deep clustering has not been investigated sufficiently. In this work, we first show that the prevalent discrimination task in supervised learning is unstable for one-stage clustering due to the lack of ground-truth labels and positive instances for certain clusters in each mini-batch. To mitigate the issue, a novel stable cluster discrimination (SeCu) task is proposed and a new hardness-aware clustering criterion can be obtained accordingly. Moreover, a global entropy constraint for cluster assignments is studied with efficient optimization. Extensive experiments are conducted on benchmark data sets and ImageNet. SeCu achieves state-of-the-art performance on all of them, which demonstrates the effectiveness of one-stage deep clustering. Code is available at \url{https://github.com/idstcv/SeCu}.

Code Repositories

idstcv/secu
Official
pytorch

Benchmarks

BenchmarkMethodologyMetrics
image-clustering-on-cifar-10SeCu
ARI: 0.857
Accuracy: 0.93
Backbone: ResNet-18
NMI: 0.861
Train set: Train
image-clustering-on-imagenetSeCu
ARI: 41.9
Accuracy: 53.5
NMI: 79.4
image-clustering-on-imagenetCoKe
ARI: 35.6
Accuracy: 47.6
NMI: 76.2
image-clustering-on-stl-10SeCu
ARI: 0.693
Accuracy: 0.836
Backbone: ResNet-18
NMI: 0.733
Train Split: Train
unsupervised-image-classification-on-cifar-10SeCu
Accuracy: 93
unsupervised-image-classification-on-cifar-20SeCu
Accuracy: 55.2

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Stable Cluster Discrimination for Deep Clustering | Papers | HyperAI