Title
On The Classification Of Bug Reports To Improve Bug Localization
Abstract
Bug localization is the automated process of finding the possible faulty files in a software project. Bug localization allows developers to concentrate on vital files. Information retrieval (IR)-based approaches have been proposed to assist automatically identify software defects by using bug report information. However, some bug reports that are not semantically related to the relevant code are not helpful to IR-based systems. Running an IR-based reporting system can lead to false-positive results. In this paper, we propose a classification model for classifying a bug report as either uninformative or informative. Our approach helps to lower false positives and increase ranking performances by filtering uninformative information before running an IR-based bug location system. The model is based on implicit features learned from bug reports that use neural networks and explicit features defined manually. We test our proposed model on three open-source software projects that contain over 9000 bug reports. The results of the evaluation show that our model enhances the efficiency of a developed IR-based system in the trade-off between precision and recall. For implicit features, our tests with comparisons show that the LSTM network performs better than the CNN and multilayer perceptron with respect to the F-measurements. Combining both implicit and explicit features outperforms using only implicit features. Our classification model helps improve precision in bug localization tasks when precision is considered more important than recall.
Year
DOI
Venue
2021
10.1007/s00500-021-05689-2
SOFT COMPUTING
Keywords
DocType
Volume
Bug classification, Bug localization, Bug report quality, Machine learning
Journal
25
Issue
ISSN
Citations 
11
1432-7643
2
PageRank 
References 
Authors
0.37
0
6
Name
Order
Citations
PageRank
Fan Fang120.37
John Wu220.37
Yanyan Li320.37
Xin Ye41776.08
Wajdi Aljedaani5122.21
Mohamed Wiem Mkaouer622828.58