Title :
Code Comparison System based on Abstract Syntax Tree
Author :
Cui, Baojiang ; Li, Jiansong ; Guo, Tao ; Wang, Jianxin ; Ma, Ding
Author_Institution :
Beijing Univ. of Posts & Telecommun., Beijing, China
Abstract :
The code comparison technology plays a very important part in the work of plagiarism detection and software evaluation. Software plagiarism mainly appears as copy-and-paste or with a little modification after this, which will not change the function of the code, such as replacing the name of methods or variables, reordering the sequence of the statements etc. This paper introduces a plagiarism detection tool named CCS (Code Comparison System) which is based on the Abstract Syntax Tree (AST). According to the syntax tree´s characteristics, CCS calculates their hash values, transforms their storage forms, and then compares them node by node. As a result, the efficiency improves. Moreover, CCS preprocesses a large amount of source code in its database for potential use, which also accelerate the course of plagiarism detection. CCS also takes special measurement to reduce mistakes when calculating the hash values of the operations like subtraction and division. It performs well in the code comparison field, and is able to help with the copyright protecting of the source code.
Keywords :
computational linguistics; copyright; program compilers; program diagnostics; tree data structures; CCS; abstract syntax tree; code comparison system; copyright; plagiarism detection; software evaluation; source code; Syntactics; CCS; Hash value; code comparison; code plagiarism; syntax tree;
Conference_Titel :
Broadband Network and Multimedia Technology (IC-BNMT), 2010 3rd IEEE International Conference on
Conference_Location :
Beijing
Print_ISBN :
978-1-4244-6769-3
DOI :
10.1109/ICBNMT.2010.5705174