Title
A cost-sensitive multi-criteria quadratic programming model for imbalanced data.
Abstract
Multiple Criteria Quadratic Programming (MCQP), a mathematical programming-based classification method, has been developed recently and proved to be effective and scalable. However, its performance degraded when learning from imbalanced data. This paper proposes a cost-sensitive MCQP (CS-MCQP) model by introducing the cost of misclassifications to the MCQP model. The empirical tests were designed to compare the proposed model with MCQP and a selection of classifiers on 26 imbalanced datasets from the UCI repositories. The results indicate that the CS-MCQP model not only performs better than the optimization-based models (MCQP and SVM), but also outperforms the selected classifiers, ensemble, preprocessing techniques and hybrid methods on imbalanced datasets in terms of AUC and GeoMean measures. To validate the results statistically, Student’s t test and Wilcoxon signed-rank test were conducted and show that the superiority of CS-MCQP is statistically significant with significance level 0.05. In addition, we analyze the effect of noisy, small disjunct and overlapping data properties on the proposed model and conclude that the CS-MCQP model achieves better performance on imbalanced data with overlapping feature than noisy and small disjunct data.
Year
DOI
Venue
2018
10.1057/s41274-017-0233-4
JORS
Keywords
Field
DocType
class imbalance problem, cost-sensitive classification, multiple criteria decision making, multi-criteria quadratic programming
Data mining,Multiple criteria,Computer science,Support vector machine,Wilcoxon signed-rank test,Preprocessor,Artificial intelligence,Quadratic programming,Machine learning,Scalability
Journal
Volume
Issue
ISSN
69
4
0160-5682
Citations 
PageRank 
References 
2
0.38
34
Authors
2
Name
Order
Citations
PageRank
Xiangrui Chao1172.58
Yi Peng2130378.20