Title
An Investigation Of Imbalanced Ensemble Learning Methods For Cross-Project Defect Prediction
Abstract
Machine-learning-based software defect prediction (SDP) methods are receiving great attention from the researchers of intelligent software engineering. Most existing SDP methods are performed under a within-project setting. However, there usually is little to no within-project training data to learn an available supervised prediction model for a new SDP task. Therefore, cross-project defect prediction (CPDP), which uses labeled data of source projects to learn a defect predictor for a target project, was proposed as a practical SDP solution. In real CPDP tasks, the class imbalance problem is ubiquitous and has a great impact on performance of the CPDP models. Unlike previous studies that focus on subsampling and individual methods, this study investigated 15 imbalanced learning methods for CPDP tasks, especially for assessing the effectiveness of imbalanced ensemble learning (IEL) methods. We evaluated the 15 methods by extensive experiments on 31 open-source projects derived from five datasets. Through analyzing a total of 37504 results, we found that in most cases, the IEL method that combined under-sampling and bagging approaches will be more effective than the other investigated methods.
Year
DOI
Venue
2019
10.1142/S0218001419590377
INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE
Keywords
Field
DocType
Intelligent software engineering, cross-project defect prediction, imbalanced learning, ensemble learning
Software bug,Cross project,Artificial intelligence,Ensemble learning,Mathematics,Machine learning
Journal
Volume
Issue
ISSN
33
12
0218-0014
Citations 
PageRank 
References 
1
0.35
3
Authors
4
Name
Order
Citations
PageRank
Shaojian Qiu183.80
Lu Lu2139.68
Siyu Jiang342.41
Yang Guo410.35