Title
A joint distribution matching model for distribution-adaptation-based cross-project defect prediction
Abstract
Using classification methods to predict software defect is receiving a great deal of attention and most of the existing studies primarily conduct prediction under the within-project setting. However, there usually had no or very limited labelled data to train an effective prediction model at an early phase of the software lifecycle. Thus, cross-project defect prediction (CPDP) is proposed as an alternative solution, which is learning a defect predictor for a target project by using labelled data from a source project. Differing from previous CPDP methods that mainly apply instances selection and classifiers adjustment to improve the performance, in this study, the authors put forward a novel distribution–adaptation-based CPDP approach, joint distribution matching (JDM). Specifically, JDM aims to minimise the joint distribution divergence between the source and target project to improve the CPDP performance. By constructing an adaptive weight vector for the instances of the source project, JDM can be effective and robust at reducing marginal distribution discrepancy and conditional distribution discrepancy simultaneously. Extensive experiments verify that JDM can outperform related distribution–adaptation-based methods on 15 open-source projects that are derived from two types of repositories.
Year
DOI
Venue
2019
10.1049/iet-sen.2018.5131
IET Software
Keywords
Field
DocType
pattern classification,learning (artificial intelligence),vectors
Data mining,Joint probability distribution,Conditional probability distribution,Computer science,Software bug,Weight,Real-time computing,Cross project,Software development process,Marginal distribution
Journal
Volume
Issue
ISSN
13
5
1751-8806
Citations 
PageRank 
References 
1
0.35
0
Authors
3
Name
Order
Citations
PageRank
Shaojian Qiu183.80
Lu Lu2139.68
Siyu Jiang342.41