HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

Incremental Constrained Clustering by Minimal Weighted Modification

{Christel Vrain Samir Loudni Thi-Bich-Hanh Dao Aymeric Beauchamp}

Incremental Constrained Clustering by Minimal Weighted Modification

Abstract

Clustering is a well-known task in Data Mining that aims at grouping data instances according to their similarity. It is an exploratory and unsupervised task whose results depend on many parameters, often requiring the expert to iterate several times before satisfaction. Constrained clustering has been introduced for better modeling the expectations of the expert. Nevertheless constrained clustering is not yet sufficient since it usually requires the constraints to be given before the clustering process. In this paper we address a more general problem that aims at modeling the exploratory clustering process, through a sequence of clustering modifications where expert constraints are added on the fly. We present an incremental constrained clustering framework integrating active query strategies and a Constraint Programming model to fit the expert expectations while preserving the stability of the partition, so that the expert can understand the process and apprehend its impact. Our model supports instance and group-level constraints, which can be relaxed. Experiments on reference datasets and a case study related to the analysis of satellite image time series show the relevance of our framework.

Benchmarks

BenchmarkMethodologyMetrics
incremental-constrained-clustering-on-irisCOP-KMeans+Random
AUBC-ARI (quality): 0.712±0.012
AUBC-ARI (similarity): 0.309±0.004
incremental-constrained-clustering-on-irisMPCK-Means+Random
AUBC-ARI (quality): 0.783±0.025
AUBC-ARI (similarity): 0.472±0.024
incremental-constrained-clustering-on-irisPCK-Means+Random
AUBC-ARI (quality): 0.695±0.018
AUBC-ARI (similarity): 0.271±0.008
incremental-constrained-clustering-on-irisIAC+Random
AUBC-ARI (quality): 0.816±0.014
AUBC-ARI (similarity): 0.605±0.016
incremental-constrained-clustering-on-irisPCK-Means+NPU
AUBC-ARI (quality): 0.876±0.018
AUBC-ARI (similarity): 0.398±0.029
incremental-constrained-clustering-on-irisIAC+NPU
AUBC-ARI (quality): 0.941±0.007
AUBC-ARI (similarity): 0.668±0.02
incremental-constrained-clustering-on-irisCOP-KMeans+NPU
AUBC-ARI (quality): 0.88±0.016
AUBC-ARI (similarity): 0.432±0.029
incremental-constrained-clustering-on-irisMPCK-Means+NPU
AUBC-ARI (quality): 0.928±0.015
AUBC-ARI (similarity): 0.584±0.027
incremental-constrained-clustering-on-winePCK-Means+NPU
AUBC-ARI (quality): 0.472±0.017
AUBC-ARI (similarity): 0.337±0.011
incremental-constrained-clustering-on-wineMPCK-Means+NPU
AUBC-ARI (quality): 0.893±0.016
AUBC-ARI (similarity): 0.817±0.002
incremental-constrained-clustering-on-wineCOP-KMeans+Random
AUBC-ARI (quality): 0.369±0.003
AUBC-ARI (similarity): 0.241±0.008
incremental-constrained-clustering-on-wineIAC+Random
AUBC-ARI (quality): 0.349±0.01
AUBC-ARI (similarity): 0.441±0.012
incremental-constrained-clustering-on-winePCK-Means+Random
AUBC-ARI (quality): 0.371±0.003
AUBC-ARI (similarity): 0.332±0.01
incremental-constrained-clustering-on-wineMPCK-Means+Random
AUBC-ARI (quality): 0.821±0.005
AUBC-ARI (similarity): 0.845±0.006
incremental-constrained-clustering-on-wineIAC+NPU
AUBC-ARI (quality): 0.481±0.016
AUBC-ARI (similarity): 0.455±0.09
incremental-constrained-clustering-on-wineCOP-KMeans+NPU
AUBC-ARI (quality): 0.469±0.019
AUBC-ARI (similarity): 0.340±0.001

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Incremental Constrained Clustering by Minimal Weighted Modification | Papers | HyperAI