Title
Error mining: bug detection through comparison with large code databases
Abstract
Bugs are hard to find. Static analysis tools are capable of systematically detecting predefined sets of errors, but extending them to find new error types requires a deep understanding of the underlying programming language. Manual reviews on the other hand, while being able to reveal more individual errors, require much more time. We present a new approach to automatically detect bugs through comparison with a large code database. The source file is analyzed for similar but slightly different code fragments in the database. Frequent occurrences of common differences indicate a potential bug that can be fixed by applying the modification back to the original source file. In this paper, we give an overview of the resulting algorithm and some important implementation details. We further evaluate the circumstances under which good detection rates can be achieved. The results demonstrate that consistently high detection rates of up to 50% are possible for certain error types across different programming languages.
Year
DOI
Venue
2012
10.1109/MSR.2012.6224278
MSR
Keywords
Field
DocType
code databases,code similarity,comparison-based bug detection,static analysis,computer bugs,programming languages,cloning,computer languages,programming language,databases,data mining,software engineering,database management systems,static analysis tools
Static program analysis,Data mining,Computer science,Source code,Static analysis,Database
Conference
ISBN
Citations 
PageRank 
978-1-4673-1761-0
1
0.36
References 
Authors
8
1
Name
Order
Citations
PageRank
Alexander Breckel151.14