Title
Can We Detect Bug Report Duplication with Unfinished Bug Reports?
Abstract
It is useful if a bug tracking system can detect bug report duplication with unfinished bug reports. To investigate the feasibility, we study relations between accuracy of duplicate bug report detection using features extracted from textual information in bug reports and the number of words in bug reports in this paper. The results show that increasing the number of words to be used in duplicate detection over a certain number does not affect the accuracy very much. The results also indicate that we had better use about 100 and 80 words in Eclipse and OpenOffice, respectively, in the detection because we may have many wrong candidates of duplication if we use words of more than the numbers. We thus think that detecting bug duplication in writing a new bug report has potential of giving duplicate bug report candidates.
Year
DOI
Venue
2015
10.1109/APSEC.2015.33
2015 Asia-Pacific Software Engineering Conference (APSEC)
Keywords
Field
DocType
Duplicate Bug Report,Bug Tracking System,Free/Open Source Software Development,Textual Similarity,Bugzilla,TakeLab
Data mining,Duplicate detection,Information retrieval,Computer science,Textual information,Software bug,Bug tracking system,Feature extraction,Real-time computing,Software,Eclipse
Conference
ISSN
Citations 
PageRank 
1530-1362
0
0.34
References 
Authors
13
3
Name
Order
Citations
PageRank
Akihiro Tsuruda100.34
Yuki Manabe272.52
Masayoshi Aritsugi310951.51