DocumentCode :
1960914
Title :
Source code similarity detection by using data mining methods
Author :
Stankov, Emil ; Jovanov, Mile ; Bogdanova, Ana Madevska
Author_Institution :
Fac. of Comput. Sci. & Eng., Univ. Ss. Cyril & Methodius, Skopje, Macedonia
fYear :
2013
fDate :
24-27 June 2013
Firstpage :
257
Lastpage :
262
Abstract :
Programming courses at university and high school level, and competitions in informatics (programming), often require fast assessment of received solutions of the programming tasks. This problem is usually solved by use of automated systems that check the produced output for some test cases for every solution. In our paper we present a novel approach of representation of the programming codes as vectors, and use of these vectors in data mining analysis that could produce better assessment of the solutions. We present the results of cluster analysis that go up to 88% of correctly clustered items on average.
Keywords :
computer science education; data mining; educational courses; educational institutions; pattern clustering; programming; source coding; vectors; automated systems; cluster analysis; data mining analysis; data mining method; high school level; programming code representation; programming courses; programming tasks; source code similarity detection; university level; vectors; Algorithm design and analysis; Data mining; Educational institutions; Informatics; Programming profession; Vectors; Programming code; clustering analysis; code similarity; evaluation of source code;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information Technology Interfaces (ITI), Proceedings of the ITI 2013 35th International Conference on
Conference_Location :
Cavtat
ISSN :
1334-2762
Print_ISBN :
978-953-7138-30-1
Type :
conf
DOI :
10.2498/iti.2013.0576
Filename :
6649034
Link To Document :
بازگشت