• DocumentCode
    2397429
  • Title

    Code Comparison System based on Abstract Syntax Tree

  • Author

    Cui, Baojiang ; Li, Jiansong ; Guo, Tao ; Wang, Jianxin ; Ma, Ding

  • Author_Institution
    Beijing Univ. of Posts & Telecommun., Beijing, China
  • fYear
    2010
  • fDate
    26-28 Oct. 2010
  • Firstpage
    668
  • Lastpage
    673
  • Abstract
    The code comparison technology plays a very important part in the work of plagiarism detection and software evaluation. Software plagiarism mainly appears as copy-and-paste or with a little modification after this, which will not change the function of the code, such as replacing the name of methods or variables, reordering the sequence of the statements etc. This paper introduces a plagiarism detection tool named CCS (Code Comparison System) which is based on the Abstract Syntax Tree (AST). According to the syntax tree´s characteristics, CCS calculates their hash values, transforms their storage forms, and then compares them node by node. As a result, the efficiency improves. Moreover, CCS preprocesses a large amount of source code in its database for potential use, which also accelerate the course of plagiarism detection. CCS also takes special measurement to reduce mistakes when calculating the hash values of the operations like subtraction and division. It performs well in the code comparison field, and is able to help with the copyright protecting of the source code.
  • Keywords
    computational linguistics; copyright; program compilers; program diagnostics; tree data structures; CCS; abstract syntax tree; code comparison system; copyright; plagiarism detection; software evaluation; source code; Syntactics; CCS; Hash value; code comparison; code plagiarism; syntax tree;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Broadband Network and Multimedia Technology (IC-BNMT), 2010 3rd IEEE International Conference on
  • Conference_Location
    Beijing
  • Print_ISBN
    978-1-4244-6769-3
  • Type

    conf

  • DOI
    10.1109/ICBNMT.2010.5705174
  • Filename
    5705174