Abstract | ||
---|---|---|
Data clustering means to partition the samples in similar clusters; so that each cluster's samples have maximum similarity with each other and have a maximum distance from the samples of other clusters. Due to the problem of unsupervised clustering selection of a specific algorithm for clustering a set of unknown data is involved in much risk, and we usually fail to find the best option. Because of the complexity of the issue and inefficacy of basic clustering methods, most studies have been directed toward combined clustering methods. We name output partition of a clustering algorithm as a result. Diversity of the results of an ensemble of basic clusterings is one of the most important factors that can affect the quality of the final result. The quality of those results is another factor that affects the quality of the final result. Both factors considered in recent research of combined clustering. We propose a new framework to improve the efficiency of combined clustering that is based on selection of a subset of primary clusters. Selection of a Proper subset has a crucial role in the performance of our method. The selection is done using intelligent methods. The main ideas of the proposed method for selecting a subset of the clusters are to use the clusters that are stable. This process is done by the intelligent search algorithms. To assess the clusters, stability criteria based on mutual information has been used. At last, the selected clusters are going to be aggregated by some consensus functions. Experimental results on several standard datasets show that the proposed method can effectively improve the complete ensemble method. |
Year | DOI | Venue |
---|---|---|
2015 | 10.1007/978-3-319-27060-9_40 | ADVANCES IN ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING, MICAI 2015, PT I |
Keywords | Field | DocType |
Combined clustering,Cluster analysis,Mutual information,Genetic algorithm,Simulated annealing algorithm,Extended evidence accumulation clustering,Correlation matrix | Simulated annealing,Cluster (physics),Search algorithm,Pattern recognition,Computer science,Algorithm,Mutual information,Artificial intelligence,Covariance matrix,Cluster analysis,Partition (number theory),Genetic algorithm | Conference |
Volume | ISSN | Citations |
9413 | 0302-9743 | 0 |
PageRank | References | Authors |
0.34 | 0 | 3 |
Name | Order | Citations | PageRank |
---|---|---|---|
Shahrbanoo Ahmadi | 1 | 0 | 0.34 |
Hamid Parvin | 2 | 263 | 41.94 |
Farhad Rad | 3 | 2 | 3.40 |