Title
Local versus Global Models for Just-In-Time Software Defect Prediction
Abstract
AbstractJust-in-time software defect prediction (JIT-SDP) is an active topic in software defect prediction, which aims to identify defect-inducing changes. Recently, some studies have found that the variability of defect data sets can affect the performance of defect predictors. By using local models, it can help improve the performance of prediction models. However, previous studies have focused on module-level defect prediction. Whether local models are still valid in the context of JIT-SDP is an important issue. To this end, we compare the performance of local and global models through a large-scale empirical study based on six open-source projects with 227417 changes. The experiment considers three evaluation scenarios of cross-validation, cross-project-validation, and timewise-cross-validation. To build local models, the experiment uses the k-medoids to divide the training set into several homogeneous regions. In addition, logistic regression and effort-aware linear regression (EALR) are used to build classification models and effort-aware prediction models, respectively. The empirical results show that local models perform worse than global models in the classification performance. However, local models have significantly better effort-aware prediction performance than global models in the cross-validation and cross-project-validation scenarios. Particularly, when the number of clusters k is set to 2, local models can obtain optimal effort-aware prediction performance. Therefore, local models are promising for effort-aware JIT-SDP.
Year
DOI
Venue
2019
10.1155/2019/2384706
Periodicals
Field
DocType
Volume
Training set,Data set,Homogeneous,Computer science,Software bug,Speech recognition,Artificial intelligence,Predictive modelling,Logistic regression,Empirical research,Machine learning,Linear regression
Journal
2019
Issue
ISSN
Citations 
1
1058-9244
0
PageRank 
References 
Authors
0.34
0
5
Name
Order
Citations
PageRank
Xingguang Yang102.03
Huiqun Yu219136.27
Guisheng Fan39125.45
Kai Shi4113.47
Liqiong Chen57519.61