Title
Geometric Structural Ensemble Learning for Imbalanced Problems.
Abstract
The classification on imbalanced data sets is a great challenge in machine learning. In this paper, a geometric structural ensemble (GSE) learning framework is proposed to address the issue. It is known that the traditional ensemble methods train and combine a series of basic classifiers according to various weights, which might lack the geometric meaning. Oppositely, the GSE partitions and eliminates redundant majority samples by generating hyper-sphere through the Euclidean metric and learns basic classifiers to enclose the minority samples, which achieves higher efficiency in the training process and seems easier to understand. In detail, the current weak classifier builds boundaries between the majority and the minority samples and removes the former. Then, the remaining samples are used to train the next. When the training process is done, all of the majority samples could be cleaned and the combination of all basic classifiers is obtained. To further improve the generalization, two relaxation techniques are proposed. Theoretically, the computational complexity of GSE could approach <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"> <tex-math notation="LaTeX">$ {O(nd\log (n_{\min })\log (n_{\mathrm{ maj}}))}$ </tex-math></inline-formula> . The comprehensive experiments validate both the effectiveness and efficiency of GSE.
Year
DOI
Venue
2020
10.1109/TCYB.2018.2877663
IEEE transactions on cybernetics
Keywords
Field
DocType
Training,Bagging,Boosting,Cybernetics,Optimization,Chemical processes
Data set,Euclidean distance,Artificial intelligence,Classifier (linguistics),Ensemble learning,Machine learning,Mathematics,Computational complexity theory
Journal
Volume
Issue
ISSN
50
4
2168-2267
Citations 
PageRank 
References 
5
0.42
0
Authors
5
Name
Order
Citations
PageRank
Zonghai Zhu1113.54
Zhe Wang25020.04
Dongdong Li3158.34
Yujin Zhu4365.28
Wenli Du517930.50