HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

C3: Cross-instance guided Contrastive Clustering

Mohammadreza Sadeghi Hadi Hojjati Narges Armanfard

C3: Cross-instance guided Contrastive Clustering

Abstract

Clustering is the task of gathering similar data samples into clusters without using any predefined labels. It has been widely studied in machine learning literature, and recent advancements in deep learning have revived interest in this field. Contrastive clustering (CC) models are a staple of deep clustering in which positive and negative pairs of each data instance are generated through data augmentation. CC models aim to learn a feature space where instance-level and cluster-level representations of positive pairs are grouped together. Despite improving the SOTA, these algorithms ignore the cross-instance patterns, which carry essential information for improving clustering performance. This increases the false-negative-pair rate of the model while decreasing its true-positive-pair rate. In this paper, we propose a novel contrastive clustering method, Cross-instance guided Contrastive Clustering (C3), that considers the cross-sample relationships to increase the number of positive pairs and mitigate the impact of false negative, noise, and anomaly sample on the learned representation of data. In particular, we define a new loss function that identifies similar instances using the instance-level representation and encourages them to aggregate together. Moreover, we propose a novel weighting method to select negative samples in a more efficient way. Extensive experimental evaluations show that our proposed method can outperform state-of-the-art algorithms on benchmark computer vision datasets: we improve the clustering accuracy by 6.6%, 3.3%, 5.0%, 1.3% and 0.3% on CIFAR-10, CIFAR-100, ImageNet-10, ImageNet-Dogs, and Tiny-ImageNet.

Code Repositories

Armanfard-Lab/C3
Official
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
image-clustering-on-cifar-10C3
ARI: 0.707
Accuracy: 0.838
NMI: 0.748
image-clustering-on-cifar-100C3
ARI: 0.275
Accuracy: 0.451
NMI: 0.434
image-clustering-on-imagenet-10C3
ARI: 0.861
Accuracy: 0.942
NMI: 0.905
image-clustering-on-imagenet-dog-15C3
ARI: 0.28
Accuracy: 0.434
NMI: 0.448
image-clustering-on-tiny-imagenetC3
ARI: 0.065
Accuracy: 0.141
NMI: 0.335

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
C3: Cross-instance guided Contrastive Clustering | Papers | HyperAI