Abstract | ||
---|---|---|
AbstractHighlights •Class imbalance learning is done on manifold space to identify overlapping data.•Manifold clustering-based resampling strategy is proposed to avoid useless sample.•Each sub-cluster in MECS-Ensemble has distinct size.•The number of samples removed or generated from a sub-cluster adaptively adjusted.•A fitness function is designed to comprehensively evaluate a classifier structure. AbstractFor an imbalanced dataset, traditional machine learning methods usually misclassify minority samples due to the indicator evaluating classification accuracy biased toward majority class. To address the issue, manifold cluster-based evolutionary ensemble imbalance learning is proposed, with the purpose of providing a more effective framework for building an optimal imbalance classifier. After mapping the original data to manifold space, majority samples are removed from each sub-cluster in terms of their distribution characteristic. Following that, a new one is generated in each minority sub-cluster by over-sampling, with the purpose of avoiding a misclassified new minority sample that produced from small disjuncts. In above manifold clustering-based resampling techniques, optional operations and key parameters for normalization, manifold learning, clustering, under-sampling and over-sampling form various combination. Thus, evolutionary algorithm is introduced to seek the optimal structure for MECS-Ensemble. Each individual is encoded by five integer and six real number, and a fitness function is designed to evaluate its classification accuracy and the diversity of majority samples. The statistical experimental results for 39 imbalanced datasets show that MECS-Ensemble proposed in the paper is superior to the other imbalance learning methods, especially, manifold clustering-based resampling technique contributes to significant performance improvements. |
Year | DOI | Venue |
---|---|---|
2021 | 10.1016/j.cie.2021.107523 | Periodicals |
Keywords | DocType | Volume |
Class-imbalance, Manifold learning, Clustering, Ensemble learning, Evolutionary algorithm | Journal | 159 |
Issue | ISSN | Citations |
C | 0360-8352 | 0 |
PageRank | References | Authors |
0.34 | 0 | 6 |
Name | Order | Citations | PageRank |
---|---|---|---|
Yinan Guo | 1 | 0 | 0.34 |
Jiawei Feng | 2 | 0 | 0.34 |
Botao Jiao | 3 | 0 | 0.34 |
Linkai Yang | 4 | 0 | 0.34 |
Hui Lu | 5 | 0 | 0.34 |
Zekuan Yu | 6 | 0 | 2.70 |