Title
Sequential Alternating Proximal Method for Scalable Sparse Structural SVMs
Abstract
Structural Support Vector Machines (SSVMs) have recently gained wide prominence in classifying structured and complex objects like parse-trees, image segments and Part-of-Speech (POS) tags. Typical learning algorithms used in training SSVMs result in model parameters which are vectors residing in a large-dimensional feature space. Such a high-dimensional model parameter vector contains many non-zero components which often lead to slow prediction and storage issues. Hence there is a need for sparse parameter vectors which contain a very small number of non-zero components. L1-regularizer and elastic net regularizer have been traditionally used to get sparse model parameters. Though L1-regularized structural SVMs have been studied in the past, the use of elastic net regularizer for structural SVMs has not been explored yet. In this work, we formulate the elastic net SSVM and propose a sequential alternating proximal algorithm to solve the dual formulation. We compare the proposed method with existing methods for L1-regularized Structural SVMs. Experiments on large-scale benchmark datasets show that the proposed dual elastic net SSVM trained using the sequential alternating proximal algorithm scales well and results in highly sparse model parameters while achieving a comparable generalization performance. Hence the proposed sequential alternating proximal algorithm is a competitive method to achieve sparse model parameters and a comparable generalization performance when elastic net regularized Structural SVMs are used on very large datasets.
Year
DOI
Venue
2012
10.1109/ICDM.2012.81
ICDM
Keywords
Field
DocType
high-dimensional model parameter vector,scalable sparse structural svms,sequential alternating proximal method,proximal algorithm,structural svms,model parameter,sparse model parameter,comparable generalization performance,l1-regularized structural svms,sparse parameter vector,non-zero component,regularized structural svms,support vector machines,learning artificial intelligence
Small number,Data mining,Feature vector,Pattern recognition,Computer science,Sparse model,Elastic net regularization,Support vector machine,Artificial intelligence,Model parameter,Machine learning,Scalability
Conference
ISSN
Citations 
PageRank 
1550-4786
3
0.41
References 
Authors
7
3
Name
Order
Citations
PageRank
P. Balamurugan1132.92
Shirish Shevade2615.35
T. Ravindra Babu3576.26