Title
Automatic, high accuracy prediction of reopened bugs
Abstract
Bug fixing is one of the most time-consuming and costly activities of the software development life cycle. In general, bugs are reported in a bug tracking system, validated by a triage team, assigned for someone to fix, and finally verified and closed. However, in some cases bugs have to be reopened. Reopened bugs increase software maintenance cost, cause rework for already busy developers and in some cases even delay the future delivery of a software release. Therefore, a few recent studies focused on studying reopened bugs. However, these prior studies did not achieve high performance (in terms of precision and recall), required manual intervention, and used very simplistic techniques when dealing with this textual data, which leads us to believe that further improvements are possible. In this paper, we propose ReopenPredictor, which is an automatic, high accuracy predictor of reopened bugs. ReopenPredictor uses a number of features, including textual features, to achieve high accuracy prediction of reopened bugs. As part of ReopenPredictor, we propose two algorithms that are used to automatically estimate various thresholds to maximize the prediction performance. To examine the benefits of ReopenPredictor, we perform experiments on three large open source projects--namely Eclipse, Apache HTTP and OpenOffice. Our results show that ReopenPredictor outperforms prior work, achieving a reopened F-measure of 0.744, 0.770, and 0.860 for Eclipse, Apache HTTP and OpenOffice, respectively. These results correspond to an improvement in the reopened F-measure of the method proposed in the prior work by Shihab et al. by 33.33, 12.57 and 3.12 % for Eclipse, Apache HTTP and OpenOffice, respectively.
Year
DOI
Venue
2015
10.1007/s10515-014-0162-2
Automated Software Engineering
Keywords
Field
DocType
Reopened bugs,Imbalanced feature selection,Imbalanced learning
Data mining,Computer science,Theoretical computer science,Eclipse,Artificial intelligence,Software maintenance,Software release life cycle,Rework,Precision and recall,Bug tracking system,Triage,Systems development life cycle,Machine learning
Journal
Volume
Issue
ISSN
22
1
0928-8910
Citations 
PageRank 
References 
35
0.82
48
Authors
5
Name
Order
Citations
PageRank
Xin Xia197265.97
David Lo25346259.67
Emad Shihab3122954.74
xinyu459030.19
Bo Zhou524112.42