Title
Simultaneous Feature Selection and Cluster Analysis Using Genetic Algorithm
Abstract
Cluster analysis being one of the important techniques of data mining applied in several fields such as bioinformatics, social networks, computer vision, and so on. It is an unsupervised learning technique for exploring the structure of the data without class label. Many clustering algorithms have been proposed to analyze high volume of data, but very few of them evaluate the quality of the clusters due to irrelevant and inconsistent features present in the dataset. So, feature selection is an important pre-processing step in data analysis mainly for high dimensional dataset. In the paper, we select optimal subset of features and perform clusters analysis simultaneously using genetic algorithm. Basically, genetic algorithm is used to select the optimal subset of features which automatically finds optimal number of clusters sat the end of the process. Optimality of the clusters is measured by calculating various cluster validation indices. The overall performance of the method is investigated on popular UCI datasets and the experimental results are compared with Fuzzy C-Means algorithm to demonstrate effectiveness of the proposed method.
Year
DOI
Venue
2016
10.1109/ICIT.2016.064
2016 International Conference on Information Technology (ICIT)
Keywords
Field
DocType
Feature Selection,Cluster Analysis,Cluster validation index,Genetic Algorithm
k-medians clustering,Data mining,k-means clustering,Algorithm design,Pattern recognition,Feature selection,Computer science,Fuzzy logic,Unsupervised learning,Artificial intelligence,Cluster analysis,Genetic algorithm
Conference
ISBN
Citations 
PageRank 
978-1-5090-3585-4
0
0.34
References 
Authors
0
4
Name
Order
Citations
PageRank
Sunanda Das1211.96
Shreya Chaudhuri200.68
Sujata Ghatak300.34
Asit Kumar Das47316.06