Abstract | ||
---|---|---|
Cluster analysis being one of the important techniques of data mining applied in several fields such as bioinformatics, social networks, computer vision, and so on. It is an unsupervised learning technique for exploring the structure of the data without class label. Many clustering algorithms have been proposed to analyze high volume of data, but very few of them evaluate the quality of the clusters due to irrelevant and inconsistent features present in the dataset. So, feature selection is an important pre-processing step in data analysis mainly for high dimensional dataset. In the paper, we select optimal subset of features and perform clusters analysis simultaneously using genetic algorithm. Basically, genetic algorithm is used to select the optimal subset of features which automatically finds optimal number of clusters sat the end of the process. Optimality of the clusters is measured by calculating various cluster validation indices. The overall performance of the method is investigated on popular UCI datasets and the experimental results are compared with Fuzzy C-Means algorithm to demonstrate effectiveness of the proposed method. |
Year | DOI | Venue |
---|---|---|
2016 | 10.1109/ICIT.2016.064 | 2016 International Conference on Information Technology (ICIT) |
Keywords | Field | DocType |
Feature Selection,Cluster Analysis,Cluster validation index,Genetic Algorithm | k-medians clustering,Data mining,k-means clustering,Algorithm design,Pattern recognition,Feature selection,Computer science,Fuzzy logic,Unsupervised learning,Artificial intelligence,Cluster analysis,Genetic algorithm | Conference |
ISBN | Citations | PageRank |
978-1-5090-3585-4 | 0 | 0.34 |
References | Authors | |
0 | 4 |
Name | Order | Citations | PageRank |
---|---|---|---|
Sunanda Das | 1 | 21 | 1.96 |
Shreya Chaudhuri | 2 | 0 | 0.68 |
Sujata Ghatak | 3 | 0 | 0.34 |
Asit Kumar Das | 4 | 73 | 16.06 |