On Clustering and Interpreting with Rules by Means of Mathematical Optimization

Emilio Carrizosa, Kseniia Kurishchenko*, Alfredo Marín, Dolores Romero Morales

*Corresponding author af dette arbejde

Publikation: Bidrag til tidsskriftTidsskriftartikelForskningpeer review

160 Downloads (Pure)

Abstract

In this paper, we make Cluster Analysis more interpretable with a new approach that simultaneously allocates individuals to clusters and gives rule-based explanations to each cluster. The traditional homogeneity metric in clustering, namely the sum of the dissimilarities between individuals in the same cluster, is enriched by considering also, for each cluster and its associated explanation, two explainability criteria, namely, the accuracy of the explanation, i.e., how many individuals within the cluster satisfy its explanation, and the distinctiveness of the explanation, i.e., how many individuals outside the cluster satisfy its explanation. Finding the clusters and the explanations optimizing a joint measure of homogeneity, accuracy, and distinctiveness is formulated as a multi-objective Mixed Integer Linear Optimization problem, from which non-dominated solutions are generated. Our approach is tested on real-world datasets.
OriginalsprogEngelsk
Artikelnummer106180
TidsskriftComputers & Operations Research
Vol/bind154
Antal sider19
ISSN0305-0548
DOI
StatusUdgivet - jun. 2023

Emneord

  • Machine learning
  • Interpretability
  • Cluster analysis
  • Rules
  • Mixed-Integer Programming

Citationsformater