Abstract | ||
---|---|---|
Area Under the ROC Curve (AUC) is an objective indicator of evaluating classification performance for imbalanced data. In order to deal with large-scale imbalanced streaming data, especially high-dimensional sparse data, this paper proposes a Sparse Stochastic Online AUC Optimization (SSOAO) method. Specifically, we first turn the standard online AUC optimization problem into a stochastic saddle point problem, then optimizing AUC by solving stochastic saddle point problem through AdaGrad optimizer. A sparse regularization term is also added for learning sparse data with high dimension. Comprehensive evaluation has been carried out on the recent benchmark. The experimental results show that the proposed SSOAO has the comparable performance on low-dimensional data, and outperforms other popular AUC optimization methods on high-dimensional sparse imbalanced streaming data. Both time and space complexity for model updating are reduced from O(d(2)) to O(d), which equal to the data dimension. |
Year | DOI | Venue |
---|---|---|
2017 | 10.1007/978-3-319-77383-4_94 | ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2017, PT II |
Keywords | Field | DocType |
Online learning,AUC optimization,Imbalanced data | Online learning,Saddle point,Pattern recognition,Computer science,Algorithm,Regularization (mathematics),Artificial intelligence,Streaming data,Area under the roc curve,Optimization problem,Sparse matrix | Conference |
Volume | ISSN | Citations |
10736 | 0302-9743 | 0 |
PageRank | References | Authors |
0.34 | 4 | 5 |