Abstract | ||
---|---|---|
As a high dimensional problem, analysis of largescale data sets is a challenging task, where many weakly relevant or redundant features hurt generalization performance of classification models. In order to solve this problem, many effective feature selection methods have proposed to eliminate redundant features in recent years. However, the comparative performances of these redundant feature detection based methods have not been reported yet, which makes the choice of feature selection method relatively difficult for many real applications. The paper presents a novel comparative study of redundant feature detection based feature selection methods. Experiments on several benchmark data sets demonstrate the comparative performances of some state-of-the-arts methods. Based on the extensive empirical results, the minimum Redundancy-Maximum Relevance (mRMR) method has been found to be the best one among all compared feature selection models. |
Year | DOI | Venue |
---|---|---|
2014 | 10.1109/CITS.2014.6878974 | CITS |
Keywords | Field | DocType |
feature selection,pattern classification,redundancy,benchmark data sets,classification models,empirical analysis,generalization performance,high-dimensional problem,large-scale data set analysis,mrmr method,minimum redundancy-maximum relevance method,redundant feature detection-based feature selection method,comparative study,redundant feature | Data mining,High dimensional problem,Data set,Dimensionality reduction,Feature detection,Pattern recognition,Feature selection,Feature (computer vision),Computer science,Minimum redundancy feature selection,Artificial intelligence | Conference |
ISSN | Citations | PageRank |
2326-2338 | 0 | 0.34 |
References | Authors | |
13 | 2 |
Name | Order | Citations | PageRank |
---|---|---|---|
Xue-qiang Zeng | 1 | 76 | 7.91 |
Qian-Sheng Chen | 2 | 0 | 0.34 |