HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

Git: Clustering Based on Graph of Intensity Topology

Zhangyang Gao; Haitao Lin; Cheng Tan; Lirong Wu; Stan. Z Li

Git: Clustering Based on Graph of Intensity Topology

Abstract

\textbf{A}ccuracy, \textbf{R}obustness to noises and scales, \textbf{I}nterpretability, \textbf{S}peed, and \textbf{E}asy to use (ARISE) are crucial requirements of a good clustering algorithm. However, achieving these goals simultaneously is challenging, and most advanced approaches only focus on parts of them. Towards an overall consideration of these aspects, we propose a novel clustering algorithm, namely GIT (Clustering Based on \textbf{G}raph of \textbf{I}ntensity \textbf{T}opology). GIT considers both local and global data structures: firstly forming local clusters based on intensity peaks of samples, and then estimating the global topological graph (topo-graph) between these local clusters. We use the Wasserstein Distance between the predicted and prior class proportions to automatically cut noisy edges in the topo-graph and merge connected local clusters as final clusters. Then, we compare GIT with seven competing algorithms on five synthetic datasets and nine real-world datasets. With fast local cluster detection, robust topo-graph construction and accurate edge-cutting, GIT shows attractive ARISE performance and significantly exceeds other non-convex clustering methods. For example, GIT outperforms its counterparts about $10\%$ (F1-score) on MNIST and FashionMNIST. Code is available at \color{red}{https://github.com/gaozhangyang/GIT}.

Code Repositories

Benchmarks

BenchmarkMethodologyMetrics
clustering-algorithms-evaluation-on-fashion-2QuickShiftPP
ARI: 16%
F1-score: 42%
NMI: 41%
clustering-algorithms-evaluation-on-fashion-2SpectACI
ARI: 29%
F1-score: 47%
NMI: 45%
clustering-algorithms-evaluation-on-fashion-2Spectral Clustering
ARI: 34%
F1-score: 43%
NMI: 49%
clustering-algorithms-evaluation-on-fashion-2AE+GIT
ARI: 49%
F1-score: 65%
NMI: 61%
clustering-algorithms-evaluation-on-fashion-2k-Means++
ARI: 35%
F1-score: 39%
NMI: 51%
clustering-algorithms-evaluation-on-fashion-2GIT
ARI: 32%
F1-score: 56%
NMI: 51%
clustering-algorithms-evaluation-on-mnistk-Means++
ARI: 36%
F1-score: 50%
NMI: 45%
clustering-algorithms-evaluation-on-mnistGIT
ARI: 42%
F1-score: 59%
NMI: 53%
clustering-algorithms-evaluation-on-mnistSpectACI
ARI: 17%
F1-score: 40%
NMI: 33%
clustering-algorithms-evaluation-on-mnistQuickShiftPP
ARI: 13%
F1-score: 45%
NMI: 45%
clustering-algorithms-evaluation-on-mnistSpectral Clustering
ARI: 33%
F1-score: 41%
NMI: 44%
clustering-algorithms-evaluation-on-mnistAE+GIT
ARI: 77%
F1-score: 88%
NMI: 81%
clustering-algorithms-evaluation-on-olivettiSpectral Clustering
ARI: 19%
F1-score: 37%
NMI: 66%
clustering-algorithms-evaluation-on-olivettik-Means++
ARI: 38%
F1-score: 52%
NMI: 74%
clustering-algorithms-evaluation-on-olivettiGIT
ARI: 45%
F1-score: 62%
NMI: 78%
clustering-algorithms-evaluation-on-olivettiSpectACI
ARI: 21%
F1-score: 34%
NMI: 61%
clustering-algorithms-evaluation-on-olivettiQuickShiftPP
ARI: 38%
F1-score: 60%
NMI: 79%

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Git: Clustering Based on Graph of Intensity Topology | Papers | HyperAI