Title
An AST-based Code Plagiarism Detection Algorithm.
Abstract
In modern software engineering, software plagiarism is widespread and uncurbed, developing plagiarism detection methods is imperative. Popular technologies of software plagiarism detection are mostly based on text, token and syntax tree. Among these plagiarism detection technologies, tree-based plagiarism detection technology can effectively detect the code which cannot be detected by the other two kinds of technologies. In this paper, we propose a more effective plagiarism detection algorithm based on abstract syntax tree (AST) by computing the hash values of the syntax tree nodes, and comparing them. In order to implement the algorithm more effectively, special measurement is taken to reduce the error rate when calculating the hash values of operations, especially the arithmetic operations like subtraction and division. Results of the test showed that the measurement is reliable and necessary. It performs well in the code comparison field, and is helpful in the area of protecting source code's copyright.
Year
DOI
Venue
2015
10.1109/BWCCA.2015.52
BWCCA
Keywords
Field
DocType
plagiarism detection, tree-based technology, code comparison, abstract syntax tree, hash value
Plagiarism detection,Computer science,Source code,Word error rate,Algorithm,Abstract syntax tree,Software,Hash function,Subtraction,Security token
Conference
Citations 
PageRank 
References 
0
0.34
9
Authors
4
Name
Order
Citations
PageRank
Jingling Zhao100.34
Kunfeng Xia200.34
Yilun Fu301.01
Baojiang Cui411240.18