Abstract | ||
---|---|---|
Clustering is one of the most versatile tools for data analysis. In the recent years, clustering that seeks the continuity of data (in opposition to classical centroid-based approaches) has attracted an increasing research interest. It is a challenging problem with a remarkable practical interest. The most popular continuity clustering method is the spectral clustering (SC) algorithm, which is based on graph cut: It initially generates a similarity graph using a distance measure and then studies its graph spectrum to find the best cut. This approach is sensitive to the parameters of the metric, and a correct parameter choice is critical to the quality of the cluster. This work proposes a new algorithm, inspired by SC, that reduces the parameter dependency while maintaining the quality of the solution. The new algorithm, named genetic graph-based clustering (GGC), takes an evolutionary approach introducing a genetic algorithm (GA) to cluster the similarity graph. The experimental validation shows that GGC increases robustness of SC and has competitive performance in comparison with classical clustering methods, at least, in the synthetic and real dataset used in the experiments. |
Year | DOI | Venue |
---|---|---|
2014 | 10.1142/S0129065714300083 | INTERNATIONAL JOURNAL OF NEURAL SYSTEMS |
Keywords | Field | DocType |
Machine learning, clustering, spectral clustering, graph clustering, genetic algorithms | Fuzzy clustering,Data mining,CURE data clustering algorithm,Computer science,Artificial intelligence,Cluster analysis,Clustering coefficient,Single-linkage clustering,k-medians clustering,Canopy clustering algorithm,Pattern recognition,Correlation clustering,Machine learning | Journal |
Volume | Issue | ISSN |
24 | 3 | 0129-0657 |
Citations | PageRank | References |
39 | 1.16 | 37 |
Authors | ||
3 |
Name | Order | Citations | PageRank |
---|---|---|---|
Héctor Menéndez | 1 | 171 | 15.75 |
David F. Barrero | 2 | 120 | 17.17 |
David Camacho | 3 | 278 | 24.89 |