Abstract | ||
---|---|---|
In many classification problems, and in particular in medical domains, it is common to have an unbalanced class distribution. This pose problems to classifiers as they tend to perform poorly in the minority class which is often the class of interest. One commonly used strategy that to improve the classification performance is to select a subset of relevant features. Feature selection algorithms, however, have not been designed to favour the classification performance of the minority class. In this paper, we present a novel filter feature selection algorithm, called FSMC, for unbalanced data sets. FSMC selects attributes that have minority class distributions significantly different from the majority class distributions. FSMC is fast, simple, selects a small number of features and outperforms in most cases other feature selection algorithms in terms of global accuracy and in terms of performance measures for the minority class such as precision, recall, F-measure and ROC values. |
Year | DOI | Venue |
---|---|---|
2011 | 10.1007/978-3-642-25085-9_49 | CIARP |
Keywords | Field | DocType |
relevant feature,minority class,unbalanced class distribution,minority class feature selection,feature selection algorithm,novel filter feature selection,minority class distribution,performance measure,classification problem,majority class distribution,classification performance,feature selection | Small number,Data set,Pattern recognition,Feature selection,Computer science,Artificial intelligence,Recall,Machine learning | Conference |
Volume | ISSN | Citations |
7042 | 0302-9743 | 0 |
PageRank | References | Authors |
0.34 | 10 | 3 |
Name | Order | Citations | PageRank |
---|---|---|---|
German Cuaya | 1 | 9 | 1.06 |
Angélica Muñoz-Meléndez | 2 | 44 | 10.51 |
Eduardo F. Morales | 3 | 559 | 57.67 |