Title | ||
---|---|---|
Restoring: A Greedy Heuristic Approach Based on Neighborhood for Correlation Clustering. |
Abstract | ||
---|---|---|
Correlation Clustering has received considerable attention in machine learning literature due to its not requiring specifying the number of clusters in advance. Many approximation algorithms for Correlation Clustering have been proposed with worst-case theoretical guarantees, but with less experimental evaluations. These methods simply consider the direct associations between vertices and achieve poor performance in real datasets. In this paper, we propose a neighborhood-based method called Restoring, in which we argue that the neighborhood around two connected vertices is important and two vertices belonging to the same cluster should have the same neighborhood. Our algorithm iteratively chooses two connected vertices and restores their neighborhood. We also define the cost of keeping or removing one non-common neighbor and identify a restoring order based on the neighborhood similarity. Experiments conducted on five sub datasets of Cora show that our method performs better than existing well-known methods both in results quality and objective value. © Springer-Verlag 2013. |
Year | DOI | Venue |
---|---|---|
2013 | 10.1007/978-3-642-53914-5_30 | ADMA (1) |
Keywords | Field | DocType |
correlation clustering,entity resolution,neighborhood similarity | Approximation algorithm,Cluster (physics),Data mining,Name resolution,Vertex (geometry),Correlation clustering,Computer science,Theoretical computer science,Greedy algorithm,Greedy randomized adaptive search procedure | Conference |
Volume | Issue | ISSN |
8346 LNAI | PART 1 | 16113349 |
Citations | PageRank | References |
0 | 0.34 | 7 |
Authors | ||
2 |