Title
Software defect prediction using doubly stochastic Poisson processes driven by stochastic belief networks.
Abstract
This research aims at better addressing the challenges related with software defect prediction.We develop a novel Bayesian inference approach driven from appropriate metrics.Formulation of our method is based on a doubly stochastic homogeneous Poisson process.Our model better learns from data with multiple modes in their distributions.We evaluate generalization across software classes, subsequent releases, and projects. Accurate prediction of software defects is of crucial importance in software engineering. Software defect prediction comprises two major procedures: (i) Design of appropriate software metrics to represent characteristic software system properties; and (ii) development of effective regression models for count data, allowing for accurate prediction of the number of software defects. Although significant research effort has been devoted to software metrics design, research in count data regression has been rather limited. More specifically, most used methods have not been explicitly designed to tackle the problem of metrics-driven software defect counts prediction, thus postulating irrelevant assumptions, such as (log-)linearity of the modeled data. In addition, a lack of simple and efficient algorithms for posterior computation has made more elaborate hierarchical Bayesian approaches appear unattractive in the context of software defect prediction. To address these issues, in this paper we introduce a doubly stochastic Poisson process for count data regression, the failure log-rate of which is driven by a novel latent space stochastic feedforward neural network. Our approach yields simple and efficient updates for its complicated conditional distributions by means of sampling importance resampling and error backpropagation. We exhibit the efficacy of our approach using publicly available and benchmark datasets.
Year
DOI
Venue
2016
10.1016/j.jss.2016.09.001
Journal of Systems and Software
Keywords
Field
DocType
Software defect prediction,Doubly stochastic Poisson process,Sampling importance resampling,Stochastic belief network
Data mining,Bayesian inference,Computer science,Software bug,Software system,Software,Artificial intelligence,Count data,Cox process,Software metric,Resampling,Machine learning
Journal
Volume
Issue
ISSN
122
C
0164-1212
Citations 
PageRank 
References 
4
0.39
27
Authors
2
Name
Order
Citations
PageRank
Andreas S. Andreou121636.65
Sotirios P. Chatzis225024.25