Title
A new feature selection method based on clustering.
Abstract
Feature selection is an effective technique to put the high dimension of data down, which is prevailing in many application domains, such as text categorization and bio-informatics, and can bring many advantages, such as improving efficiency and avoiding over-fitting, to learning algorithms. Currently, many efforts have been attempted in this field and various feature selection methods have been developed and proved to be very competitive. Unlike other selection methods, in this paper we propose a new method to select important features using a manner of feature clustering. The main character of our method is that it works like data clustering in an agglomerative way. In this method, each feature is considered as a data point clustered with between-cluster and within-cluster distances. As a result, the selected feature subset has minimal redundancy among its members and maximal relevance with the class labels. Our performance evaluations on seven benchmark datasets show that the classification performance achieved by our proposed method is better than other feature selection methods. © 2011 IEEE.
Year
DOI
Venue
2011
10.1109/FSKD.2011.6019687
FSKD
Keywords
Field
DocType
machine learning,pattern recognition,learning artificial intelligence,accuracy,measurement,mutual information,feature selection,bioinformatics,data clustering,redundancy,clustering algorithms
Data mining,Feature selection,Computer science,Feature (machine learning),Artificial intelligence,Cluster analysis,Single-linkage clustering,Pattern recognition,Correlation clustering,Feature (computer vision),Feature extraction,Feature learning,Machine learning
Conference
Volume
Issue
Citations 
2
null
0
PageRank 
References 
Authors
0.34
19
4
Name
Order
Citations
PageRank
Huawen Liu146827.18
Yuchang Mo212310.63
Ji-yi Wang3178.05
Jianming Zhao4135.25