Title :
The NCP algorithm of fuzzy source code comparison
Author :
Khaustov, Pavel A.
Author_Institution :
Department of Computer Engineering, National Research Tomsk Polytechnic University, Tomsk, Russia
Abstract :
The NCP algorithm is developed to detect the cases of plagiarism of source codes. Algorithm uses tokenize representation of the source code, trie-tree to store sequences of tokens and Levenshtein distance computation to assess the similarity of two sequences. Some optimizations are proposed to improve the performance of this algorithm.
Keywords :
C++ language; fuzzy set theory; optimisation; sequences; trees (mathematics); C++ language; Levenshtein distance computation; NCP algorithm; fuzzy source code comparison; no crib plus; optimizations; trie-tree; Abstracts; Approximation algorithms; Complexity theory; Educational institutions; Memory management; Optimization; Plagiarism; Levenshtein distanc; NCP-algorithm; NoCrib; Plagiarism; Source codes; Tokens; Trie-tree;
Conference_Titel :
Strategic Technology (IFOST), 2012 7th International Forum on
Conference_Location :
Tomsk
Print_ISBN :
978-1-4673-1772-6
DOI :
10.1109/IFOST.2012.6357644