Title
Clustering aggregation
Abstract
We consider the following problem: given a set of clusterings, find a single clustering that agrees as much as possible with the input clusterings. This problem, clustering aggregation, appears naturally in various contexts. For example, clustering categorical data is an instance of the clustering aggregation problem; each categorical attribute can be viewed as a clustering of the input rows where rows are grouped together if they take the same value on that attribute. Clustering aggregation can also be used as a metaclustering method to improve the robustness of clustering by combining the output of multiple algorithms. Furthermore, the problem formulation does not require a priori information about the number of clusters; it is naturally determined by the optimization function. In this article, we give a formal statement of the clustering aggregation problem, and we propose a number of algorithms. Our algorithms make use of the connection between clustering aggregation and the problem of correlation clustering. Although the problems we consider are NP-hard, for several of our methods, we provide theoretical guarantees on the quality of the solutions. Our work provides the best deterministic approximation algorithm for the variation of the correlation clustering problem we consider. We also show how sampling can be used to scale the algorithms for large datasets. We give an extensive empirical evaluation demonstrating the usefulness of the problem and of the solutions.
Year
DOI
Venue
2007
10.1145/1217299.1217303
TKDD
Keywords
DocType
Volume
categorical attribute,following problem,input clusterings,input row,single clustering,categorical data,correlation clustering,problem formulation,data clustering,clustering categorical data,clustering aggregation,Clustering aggregation,clustering aggregation problem
Journal
1
Issue
Citations 
PageRank 
1
109
3.11
References 
Authors
23
3
Search Limit
100109
Name
Order
Citations
PageRank
Aristides Gionis16808386.81
Heikki Mannila265951495.69
Panayiotis Tsaparas3128672.59