• DocumentCode
    3757273
  • Title

    An AST-based Code Plagiarism Detection Algorithm

  • Author

    Jingling Zhao;Kunfeng Xia;Yilun Fu;Baojiang Cui

  • Author_Institution
    Sch. of Comput. Sci., Beijing Univ. of Posts &
  • fYear
    2015
  • Firstpage
    178
  • Lastpage
    182
  • Abstract
    In modern software engineering, software plagiarism is widespread and uncurbed, developing plagiarism detection methods is imperative. Popular technologies of software plagiarism detection are mostly based on text, token and syntax tree. Among these plagiarism detection technologies, tree-based plagiarism detection technology can effectively detect the code which cannot be detected by the other two kinds of technologies. In this paper, we propose a more effective plagiarism detection algorithm based on abstract syntax tree (AST) by computing the hash values of the syntax tree nodes, and comparing them. In order to implement the algorithm more effectively, special measurement is taken to reduce the error rate when calculating the hash values of operations, especially the arithmetic operations like subtraction and division. Results of the test showed that the measurement is reliable and necessary. It performs well in the code comparison field, and is helpful in the area of protecting source code´s copyright.
  • Keywords
    "Syntactics","Plagiarism","Software","Software algorithms","Algorithm design and analysis","Semantics","Feature extraction"
  • Publisher
    ieee
  • Conference_Titel
    Broadband and Wireless Computing, Communication and Applications (BWCCA), 2015 10th International Conference on
  • Type

    conf

  • DOI
    10.1109/BWCCA.2015.52
  • Filename
    7424821