Title :
A New Similarity Measure Based on Weibull Distribution for Detecting Plagiarized Source Codes
Author :
Ji, Jeong-Hoon ; Woo, Gyun ; Cho, Hwan-Gue
Author_Institution :
Dept. of Comput. Eng., Pusan Nat. Univ., Pusan
Abstract :
Most of previously released plagiarism detection tools and systems have been adopted normalized similarity measures which are unreliably sensitive to the size of programs compared. Also the most of previously announced tools have difficulties in determining the cutoff threshold to discriminate the plagiarized codes from innocent ones. In this paper, we present a new discriminating method based on Weibull distribution which was mainly used in studying genomic sequence similarity with statistical significance. We applied our new detection method to a real programming competition, ICPC East Asia Regional Contest. Our system was quite successful to detect a few plagiarized codes in the preliminary round of the contest. This experience clearly revealed the characteristics of similarity among the source codes submitted in programming contests.
Keywords :
Weibull distribution; biology computing; computer crime; genetics; program diagnostics; sequences; software tools; Weibull distribution; cutoff threshold; genomic sequence similarity; normalized similarity measure; plagiarized source code detection; Automatic programming; Bioinformatics; Biology computing; Distributed computing; Genomics; Information technology; Plagiarism; Sequences; Size measurement; Weibull distribution;
Conference_Titel :
Convergence and Hybrid Information Technology, 2008. ICHIT '08. International Conference on
Conference_Location :
Daejeon
Print_ISBN :
978-0-7695-3328-5
DOI :
10.1109/ICHIT.2008.277