Abstract | ||
---|---|---|
The Correlation Clustering problem has been introduced recently [N. Bansal, A. Blum, S. Chawla, Correlation Clustering, in: Proc. 43rd Symp. Foundations of Computer Science, FOCS, 2002, pp. 238-247] as a model for clustering data when a binary relationship between data points is known. More precisely, for each pair of points we have two scores measuring the similarity and dissimilarity respectively, of the two points, and we would like to compute an optimal partition where the value of a partition is obtained by summing up the similarity scores of pairs involving points from the same cluster and the dissimilarity scores of pairs involving points from different clusters. A closely related problem is Consensus Clustering, where we are given a set of partitions and we would like to obtain a partition that best summarizes the input partitions. The latter problem is a restricted case of Correlation Clustering. In this paper we prove that Minimum Consensus Clustering is APX-hard even for three input partitions, answering an open question in the literature, while Maximum Consensus Clustering admits a PTAS. We exhibit a combinatorial and practical 45-approximation algorithm based on a greedy technique for Maximum Consensus Clustering on three partitions. Moreover, we prove that a PTAS exists for Maximum Correlation Clustering when the maximum ratio between two scores is at most a constant. |
Year | DOI | Venue |
---|---|---|
2008 | 10.1016/j.jcss.2007.06.024 | J. Comput. Syst. Sci. |
Keywords | Field | DocType |
optimal partition,correlation clustering,minimum consensus clustering,correlation clustering problem,apx-hardness,latter problem,maximum correlation clustering,input partition,consensus clustering,related problem,approximation,ptas,maximum consensus clustering | k-medians clustering,Fuzzy clustering,Discrete mathematics,CURE data clustering algorithm,Combinatorics,Complete-linkage clustering,Correlation clustering,Consensus clustering,Cluster analysis,Mathematics,Single-linkage clustering | Journal |
Volume | Issue | ISSN |
74 | 5 | Journal of Computer and System Sciences |
Citations | PageRank | References |
16 | 0.71 | 15 |
Authors | ||
4 |
Name | Order | Citations | PageRank |
---|---|---|---|
Paola Bonizzoni | 1 | 502 | 52.23 |
Gianluca Della Vedova | 2 | 342 | 36.39 |
Riccardo Dondi | 3 | 16 | 0.71 |
Tao Jiang | 4 | 1809 | 155.32 |