HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

Improving k-Means Clustering Performance with Disentangled Internal Representations

Abien Fred Agarap Arnulfo P. Azcarraga

Improving k-Means Clustering Performance with Disentangled Internal Representations

Abstract

Deep clustering algorithms combine representation learning and clustering by jointly optimizing a clustering loss and a non-clustering loss. In such methods, a deep neural network is used for representation learning together with a clustering network. Instead of following this framework to improve clustering performance, we propose a simpler approach of optimizing the entanglement of the learned latent code representation of an autoencoder. We define entanglement as how close pairs of points from the same class or structure are, relative to pairs of points from different classes or structures. To measure the entanglement of data points, we use the soft nearest neighbor loss, and expand it by introducing an annealing temperature factor. Using our proposed approach, the test clustering accuracy was 96.2% on the MNIST dataset, 85.6% on the Fashion-MNIST dataset, and 79.2% on the EMNIST Balanced dataset, outperforming our baseline models.

Code Repositories

https://gitlab.com/afagarap/pt-snnl
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
image-classification-on-emnist-balancedSNNL-5
Accuracy: 78.5
NMI: 77.6
image-classification-on-fashion-mnistSNNL-5
Accuracy: 84.4
NMI: 76.2
image-classification-on-mnistSNNL-5
Accuracy: 95.5
image-clustering-on-emnist-balancedSNNL-8
Accuracy: 0.35
NMI: 0.442
image-clustering-on-emnist-balancedAE+SNNL
Accuracy: 0.792
NMI: 0.783
image-clustering-on-fashion-mnistSNNL-4
Accuracy: 0.555
NMI: 0.574
image-clustering-on-mnistSNNL-7
NMI: 0.891
image-clustering-on-mnist-testAE+SNNL
Accuracy: 0.962
NMI: 0.903

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Improving k-Means Clustering Performance with Disentangled Internal Representations | Papers | HyperAI