Title :
Towards the Validation of Plagiarism Detection Tools by Means of Grammar Evolution
Author :
Cebrián, Manuel ; Alfonseca, Manuel ; Ortega, Alfonso
Author_Institution :
Dept. of Comput. Sci., Brown Univ., Providence, RI
fDate :
6/1/2009 12:00:00 AM
Abstract :
Student plagiarism is a major problem in universities worldwide. In this paper, we focus on plagiarism in answers to computer programming assignments, where students mix and/or modify one or more original solutions to obtain counterfeits. Although several software tools have been developed to help the tedious and time consuming task of detecting plagiarism, little has been done to assess their quality, because determining the real authorship of the whole submission corpus is practically impossible for markers. In this paper, we present a grammar evolution technique which generates benchmarks for testing plagiarism detection tools. Given a programming language, our technique generates a set of original solutions to an assignment, together with a set of plagiarisms of the former set which mimic the basic plagiarism techniques performed by students. The authorship of the submission corpus is predefined by the user, providing a base for the assessment and further comparison of copy-catching tools. We give empirical evidence of the suitability of our approach by studying the behavior of one advanced plagiarism detection tool (AC) on four benchmarks coded in APL2, generated with our technique.
Keywords :
authoring systems; computer science education; grammars; programming languages; authorship; automatic programming; computer programming assignment; computer science education; copy-catching tool; counterfeit; grammar evolution; plagiarism detection tool; programming language; software tool; student plagiarism; Automatic programming; computer science education; educational technology; genetic algorithms;
Journal_Title :
Evolutionary Computation, IEEE Transactions on
DOI :
10.1109/TEVC.2008.2008797