Title :
Token-based Code Clone Detection Technique in a Student´s Programming Exercise
Author :
Iwamoto, Mitsugu ; Oshima, S. ; Nakashima, Takayoshi
Author_Institution :
Dept. of Electron. & Inf. Eng., Kumamoto Nat. Coll. of Technol., Yatsushiro, Japan
Abstract :
The acts to submit the copied programs of other person make problems in the subject of the programming exercise in university curriculum. Teachers could not make accurate scores and evaluate the reached learning level of students. The code clone detection technique is to automatically detect the copied programs. Researches of the code clone detection technique have been proposed. The object of these researches, however, focused on the source code of industrial field. There are some problems to detect illicit copied codes of reports made by students. In this research, we developed the code clone detection algorithm focusing on the detection of illicit copied codes of submitted reports of students in a programming exercise. Our proposed algorithm is based on the comparison of tokens and can declare the illicit copied codes invalid. The features of illicit copied codes such as swapping the functions and program lines, renaming variable names, changing digits, comments and string constants and changing source codes using formatting tools are detected. We implemented the proposed algorithm and experimented to evaluate our system for the submitted subjects of 119 students. Compared to the human detection for small size of source codes of students in a programming exercise, our system found 32 codes as the illicit copy in 36 illicit copied codes among 14,042 combination detective rules with the threshold which are realized the recall=0.8. The miss detection finding as the copied code was 72 codes with precision=0.302.
Keywords :
computer science education; educational courses; copied programs; functions swapping; human detection; illicit copied codes; program line swapping; string constants; student programming exercise; token-based code clone detection technique; university curriculum; Cloning; Educational institutions; Equations; Feature extraction; Measurement; Programming; Semantics; Code Clone; Detection System; Programming Exercise; Short Source Code; Token;
Conference_Titel :
Broadband, Wireless Computing, Communication and Applications (BWCCA), 2012 Seventh International Conference on
Conference_Location :
Victoria, BC
Print_ISBN :
978-1-4673-2972-9
DOI :
10.1109/BWCCA.2012.113