5 months ago

Stable Cluster Discrimination for Deep Clustering

Qi Qian

Abstract

Deep clustering can optimize representations of instances (i.e., representation learning) and explore the inherent data distribution (i.e., clustering) simultaneously, which demonstrates a superior performance over conventional clustering methods with given features. However, the coupled objective implies a trivial solution that all instances collapse to the uniform features. To tackle the challenge, a two-stage training strategy is developed for decoupling, where it introduces an additional pre-training stage for representation learning and then fine-tunes the obtained model for clustering. Meanwhile, one-stage methods are developed mainly for representation learning rather than clustering, where various constraints for cluster assignments are designed to avoid collapsing explicitly. Despite the success of these methods, an appropriate learning objective tailored for deep clustering has not been investigated sufficiently. In this work, we first show that the prevalent discrimination task in supervised learning is unstable for one-stage clustering due to the lack of ground-truth labels and positive instances for certain clusters in each mini-batch. To mitigate the issue, a novel stable cluster discrimination (SeCu) task is proposed and a new hardness-aware clustering criterion can be obtained accordingly. Moreover, a global entropy constraint for cluster assignments is studied with efficient optimization. Extensive experiments are conducted on benchmark data sets and ImageNet. SeCu achieves state-of-the-art performance on all of them, which demonstrates the effectiveness of one-stage deep clustering. Code is available at \url{https://github.com/idstcv/SeCu}.

Code Repositories

idstcv/secu

Official

pytorch

Benchmarks

Benchmark	Methodology	Metrics
image-clustering-on-cifar-10	SeCu	ARI: 0.857 Accuracy: 0.93 Backbone: ResNet-18 NMI: 0.861 Train set: Train
image-clustering-on-imagenet	SeCu	ARI: 41.9 Accuracy: 53.5 NMI: 79.4
image-clustering-on-imagenet	CoKe	ARI: 35.6 Accuracy: 47.6 NMI: 76.2
image-clustering-on-stl-10	SeCu	ARI: 0.693 Accuracy: 0.836 Backbone: ResNet-18 NMI: 0.733 Train Split: Train
unsupervised-image-classification-on-cifar-10	SeCu	Accuracy: 93
unsupervised-image-classification-on-cifar-20	SeCu	Accuracy: 55.2

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started

Hyper Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

Command Palette