Title
NiCad+: Speeding the Detecting Process of NiCad
Abstract
With the development of the Internet and the construction of open source software communities, there has been a surge in open source software. Code Reuse—copy-past and modify open source code, which becomes a convenient choice for developers to save time and reduce labor costs. So there are more and more similar code fragments, code clones, in code project as a popular phenomenon. The code clone may import uncertainties into the program, which is a hot spot for urgent exploration. This paper summarized code clone detection tools and techniques in four categories at present and introduced one detection tool, NiCad, with high recall and precision. However, NiCad is not perfect for large-scale code clone detection scenarios, because NiCad is slow when dealing with large-scale of codes. Therefore, we speeded the detection process of NiCad, and and named the improved tool NiCad+. We greatly improved the efficiency of NiCad without effecting its recall and precision. The time-cost of detecting code clone was remarkable shortened by reducing the matching times. When testing with BigCloneEval, it only takes 28.43% time-cost as original NiCad. When testing with varying input sizes, the speeded detection process performs better than the original one from 10 KLoC (lines of code) to 5 MLoC.
Year
DOI
Venue
2020
10.1109/SOSE49046.2020.00019
2020 IEEE International Conference on Service Oriented Systems Engineering (SOSE)
Keywords
DocType
ISSN
code clone detection, NiCad, large scale, similarity, open source software
Conference
2640-8228
ISBN
Citations 
PageRank 
978-1-7281-6973-6
0
0.34
References 
Authors
24
6
Name
Order
Citations
PageRank
Chenhui Feng100.34
Tao Wang26123.52
Jinze Liu300.34
Yang Zhang400.34
Kele Xu54621.80
Yijie Wang623942.22