Title | ||
---|---|---|
Beyond classical consensus clustering: The least squares approach to multiple solutions |
Abstract | ||
---|---|---|
Clustering is one of the most important unsupervised learning problems and it consists of finding a common structure in a collection of unlabeled data. However, due to the ill-posed nature of the problem, different runs of the same clustering algorithm applied to the same data-set usually produce different solutions. In this scenario choosing a single solution is quite arbitrary. On the other hand, in many applications the problem of multiple solutions becomes intractable, hence it is often more desirable to provide a limited group of ''good'' clusterings rather than a single solution. In the present paper we propose the least squares consensus clustering. This technique allows to extrapolate a small number of different clustering solutions from an initial (large) ensemble obtained by applying any clustering algorithm to a given data-set. We also define a measure of quality and present a graphical visualization of each consensus clustering to make immediately interpretable the strength of the consensus. We have carried out several numerical experiments both on synthetic and real data-sets to illustrate the proposed methodology. |
Year | DOI | Venue |
---|---|---|
2011 | 10.1016/j.patrec.2011.05.003 | Pattern Recognition Letters |
Keywords | Field | DocType |
multiple solution,different run,different solution,common structure,different clustering solution,classical consensus clustering,important unsupervised learning problem,present paper,data visualization,least-squares consensus,single solution,clustering,clustering algorithm,squares consensus clustering,least square,unsupervised learning | Fuzzy clustering,Canopy clustering algorithm,Data mining,CURE data clustering algorithm,Pattern recognition,Correlation clustering,Consensus clustering,Artificial intelligence,Constrained clustering,Cluster analysis,Mathematics,Single-linkage clustering | Journal |
Volume | Issue | ISSN |
32 | 12 | Pattern Recognition Letters |
Citations | PageRank | References |
5 | 0.43 | 17 |
Authors | ||
5 |
Name | Order | Citations | PageRank |
---|---|---|---|
Loredana Murino | 1 | 20 | 2.91 |
Claudia Angelini | 2 | 103 | 8.70 |
I. De Feis | 3 | 27 | 4.42 |
Giancarlo Raiconi | 4 | 118 | 15.08 |
R. Tagliaferri | 5 | 128 | 12.91 |