Title
Optimized Cutting Plane Algorithm for Large-Scale Risk Minimization
Abstract
We have developed an optimized cutting plane algorithm (OCA) for solving large-scale risk minimization problems. We prove that the number of iterations OCA requires to converge to a ε precise solution is approximately linear in the sample size. We also derive OCAS, an OCA-based linear binary Support Vector Machine (SVM) solver, and OCAM, a linear multi-class SVM solver. In an extensive empirical evaluation we show that OCAS outperforms current state-of-the-art SVM solvers like SVMlight, SVMperf and BMRM, achieving speedup factor more than 1,200 over SVMlight on some data sets and speedup factor of 29 over SVMperf, while obtaining the same precise support vector solution. OCAS, even in the early optimization steps, often shows faster convergence than the currently prevailing approximative methods in this domain, SGD and Pegasos. In addition, our proposed linear multi-class SVM solver, OCAM, achieves speedups of factor of up to 10 compared to SVMmulti-class. Finally, we use OCAS and OCAM in two real-world applications, the problem of human acceptor splice site detection and malware detection. Effectively parallelizing OCAS, we achieve state-of-the-art results on an acceptor splice site recognition problem only by being able to learn from all the available 50 million examples in a 12-million-dimensional feature space. Source code, data sets and scripts to reproduce the experiments are available at http://cmp.felk.cvut.cz/~xfrancv/ocas/html/.
Year
DOI
Venue
2009
10.1145/1577069.1755859
Journal of Machine Learning Research
Keywords
Field
DocType
oca-based linear binary support,speedup factor,acceptor splice site recognition,plane algorithm,human acceptor splice site,linear multi-class svm solver,current state-of-the-art svm solvers,iterations oca,proposed linear multi-class svm,large-scale risk minimization,derive ocas
Convergence (routing),Feature vector,Source code,Computer science,Support vector machine,Algorithm,Minification,Artificial intelligence,Solver,Machine learning,Speedup,Binary number
Journal
Volume
ISSN
Citations 
10,
1532-4435
29
PageRank 
References 
Authors
4.42
26
3
Name
Order
Citations
PageRank
Vojtěch Franc158455.78
Vojtěch Franc258455.78
Sören Sonnenburg31556125.12