DocumentCode :
2968368
Title :
Plagiarism Detection through Multilevel Text Comparison
Author :
Zini, Manuel ; Fabbri, Marco ; Moneglia, Massimo ; Panunzi, Alessandro
Author_Institution :
Italian Dept., Universita di Firenze
fYear :
2006
fDate :
13-15 Dec. 2006
Firstpage :
181
Lastpage :
185
Abstract :
The paper presents the implementation of a tool for plagiarism detection developed within the AXMEDIS project. The algorithm leverages the plagiarist behaviour, which is modeled as a combination of 3 basical actions: insertion, deletion, substitution. We recognize that this behaviour may occur at various level of the document structure: the plagiarist may insert, delete or substitute a word, period or a paragraph. The procedure consists in two main steps: document structure extraction and plagiarism function calculation. We propose a recursive plagiarism evaluation function to be evaluated at each level of the document structure which is based on the Levenshtein edit distance. We also propose a method that will eliminate unnecessary chunks comparison, avoiding similarity calculation of chunks which do not share enough 4-grams. We describe the similarity algorithm and discuss some implementation issues and future work
Keywords :
text analysis; AXMEDIS project; Levenshtein edit distance; document structure extraction; multilevel text comparison; plagiarism detection; plagiarism function calculation; recursive plagiarism evaluation function; Automation; Data mining; Fingerprint recognition; Large-scale systems; Multimedia databases; Plagiarism; Security; Software tools; Transform coding; Watermarking;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Automated Production of Cross Media Content for Multi-Channel Distribution, 2006. AXMEDIS '06. Second International Conference on
Conference_Location :
Leeds
Print_ISBN :
0-7695-2625-X
Type :
conf
DOI :
10.1109/AXMEDIS.2006.40
Filename :
4041348
Link To Document :
بازگشت