Title | ||
---|---|---|
An Investigation Of Imbalanced Ensemble Learning Methods For Cross-Project Defect Prediction |
Abstract | ||
---|---|---|
Machine-learning-based software defect prediction (SDP) methods are receiving great attention from the researchers of intelligent software engineering. Most existing SDP methods are performed under a within-project setting. However, there usually is little to no within-project training data to learn an available supervised prediction model for a new SDP task. Therefore, cross-project defect prediction (CPDP), which uses labeled data of source projects to learn a defect predictor for a target project, was proposed as a practical SDP solution. In real CPDP tasks, the class imbalance problem is ubiquitous and has a great impact on performance of the CPDP models. Unlike previous studies that focus on subsampling and individual methods, this study investigated 15 imbalanced learning methods for CPDP tasks, especially for assessing the effectiveness of imbalanced ensemble learning (IEL) methods. We evaluated the 15 methods by extensive experiments on 31 open-source projects derived from five datasets. Through analyzing a total of 37504 results, we found that in most cases, the IEL method that combined under-sampling and bagging approaches will be more effective than the other investigated methods. |
Year | DOI | Venue |
---|---|---|
2019 | 10.1142/S0218001419590377 | INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE |
Keywords | Field | DocType |
Intelligent software engineering, cross-project defect prediction, imbalanced learning, ensemble learning | Software bug,Cross project,Artificial intelligence,Ensemble learning,Mathematics,Machine learning | Journal |
Volume | Issue | ISSN |
33 | 12 | 0218-0014 |
Citations | PageRank | References |
1 | 0.35 | 3 |
Authors | ||
4 |
Name | Order | Citations | PageRank |
---|---|---|---|
Shaojian Qiu | 1 | 8 | 3.80 |
Lu Lu | 2 | 13 | 9.68 |
Siyu Jiang | 3 | 4 | 2.41 |
Yang Guo | 4 | 1 | 0.35 |