Title
Feature Selection Based on Pairwise Classification Performance
Abstract
The process of feature selection is an important first step in building machine learning models. Feature selection algorithms can be grouped into wrappers and filters; the former use machine learning models to evaluate feature sets, the latter use other criteria to evaluate features individually. We present a new approach to feature selection that combines advantages of both wrapper as well as filter approaches, by using logistic regression and the area under the ROC curve (AUC) to evaluate pairs of features. After choosing as starting feature the one with the highest individual discriminatory power, we incrementally rank features by choosing as next feature the one that achieves the highest AUC in combination with an already chosen feature. To evaluate our approach, we compared it to standard filter and wrapper algorithms. Using two data sets from the biomedical domain, we are able to demonstrate that the performance of our approach exceeds that of filter methods, while being comparable to wrapper methods at smaller computational cost.
Year
DOI
Venue
2009
10.1007/978-3-642-04772-5_99
EUROCAST
Keywords
Field
DocType
feature selection,filter method,feature ranking,next feature,pairwise evaluation.,former use machine,new approach,pairwise classification performance,standard filter,wrapper algorithm,feature set,feature selection algorithm,rank feature,machine learning,logistic regression,roc curve
Data mining,Dimensionality reduction,Feature selection,Computer science,Feature (machine learning),Artificial intelligence,k-nearest neighbors algorithm,Pairwise comparison,Feature vector,Pattern recognition,Feature extraction,Machine learning,Feature learning
Conference
Volume
ISSN
Citations 
5717
0302-9743
3
PageRank 
References 
Authors
0.47
11
2
Name
Order
Citations
PageRank
Stephan Dreiseitl133834.80
Melanie Osl2716.83