Title
Manifold cluster-based evolutionary ensemble imbalance learning
Abstract
AbstractHighlights •Class imbalance learning is done on manifold space to identify overlapping data.•Manifold clustering-based resampling strategy is proposed to avoid useless sample.•Each sub-cluster in MECS-Ensemble has distinct size.•The number of samples removed or generated from a sub-cluster adaptively adjusted.•A fitness function is designed to comprehensively evaluate a classifier structure. AbstractFor an imbalanced dataset, traditional machine learning methods usually misclassify minority samples due to the indicator evaluating classification accuracy biased toward majority class. To address the issue, manifold cluster-based evolutionary ensemble imbalance learning is proposed, with the purpose of providing a more effective framework for building an optimal imbalance classifier. After mapping the original data to manifold space, majority samples are removed from each sub-cluster in terms of their distribution characteristic. Following that, a new one is generated in each minority sub-cluster by over-sampling, with the purpose of avoiding a misclassified new minority sample that produced from small disjuncts. In above manifold clustering-based resampling techniques, optional operations and key parameters for normalization, manifold learning, clustering, under-sampling and over-sampling form various combination. Thus, evolutionary algorithm is introduced to seek the optimal structure for MECS-Ensemble. Each individual is encoded by five integer and six real number, and a fitness function is designed to evaluate its classification accuracy and the diversity of majority samples. The statistical experimental results for 39 imbalanced datasets show that MECS-Ensemble proposed in the paper is superior to the other imbalance learning methods, especially, manifold clustering-based resampling technique contributes to significant performance improvements.
Year
DOI
Venue
2021
10.1016/j.cie.2021.107523
Periodicals
Keywords
DocType
Volume
Class-imbalance, Manifold learning, Clustering, Ensemble learning, Evolutionary algorithm
Journal
159
Issue
ISSN
Citations 
C
0360-8352
0
PageRank 
References 
Authors
0.34
0
6
Name
Order
Citations
PageRank
Yinan Guo100.34
Jiawei Feng200.34
Botao Jiao300.34
Linkai Yang400.34
Hui Lu500.34
Zekuan Yu602.70