Title
Imbalanced data classification using second-order cone programming support vector machines
Abstract
Learning from imbalanced data sets is an important machine learning challenge, especially in Support Vector Machines (SVM), where the assumption of equal cost of errors is made and each object is treated independently. Second-order cone programming SVM (SOCP-SVM) studies each class separately instead, providing quite an interesting formulation for the imbalanced classification task. This work presents a novel second-order cone programming (SOCP) formulation, based on the LP-SVM formulation principle: the bound of the VC dimension is loosened properly using the l"~-norm, and the margin is directly maximized using two margin variables associated with each class. A regularization parameter C is considered in order to control the trade-off between the maximization of these two margin variables. The proposed method has the following advantages: it provides better results, since it is specially designed for imbalanced classification, and it reduces computational complexity, since one conic restriction is eliminated. Experiments on benchmark imbalanced data sets demonstrate that our approach accomplishes the best classification performance, compared with the traditional SOCP-SVM formulation and with cost-sensitive formulations for linear SVM.
Year
DOI
Venue
2014
10.1016/j.patcog.2013.11.021
Pattern Recognition
Keywords
Field
DocType
lp-svm formulation principle,imbalanced classification,imbalanced classification task,cost-sensitive formulation,interesting formulation,benchmark imbalanced data set,margin variable,second-order cone programming support,vector machine,best classification performance,imbalanced data classification,imbalanced data set,traditional socp-svm formulation,support vector machines
Second-order cone programming,VC dimension,Pattern recognition,Support vector machine,Regularization (mathematics),Artificial intelligence,Data classification,Conic section,Mathematics,Maximization,Machine learning,Computational complexity theory
Journal
Volume
Issue
ISSN
47
5
0031-3203
Citations 
PageRank 
References 
24
0.71
30
Authors
2
Name
Order
Citations
PageRank
Sebastián Maldonado150832.45
Julio López212413.49