DocumentCode
1974785
Title
Winnowing-Based Similar Text Positioning Method
Author
Du Zou ; Long, WeiJiang ; Ling, Zhang
Author_Institution
Sch. of Comput. Sci. & Eng., South China Univ. of Technol., Guangzhou, China
fYear
2010
fDate
20-22 Aug. 2010
Firstpage
1
Lastpage
5
Abstract
Similar text positioning is a key step in plagiarism detection to decide the position of similar texts in the documents. A 2-step approximate merging method is proposed as follows: Heuristic approximate merging is used for error reduction in processing the text sampling fingers; Clustering methods is used to reduce the disturbance information influence on text positioning when merging the adjacent segments; Non-overlapping reverse index is used to position the similar texts. The method is applied in the homework plagiarism module of a learning platform for higher education. Taking PAN´09 public plagiarism corpus as benchmark, the principal performance indexes are better than those of reported finger-based methods and commercial software.
Keywords
further education; merging; pattern clustering; security of data; text analysis; word processing; 2-step approximate merging method; PAN´09 public plagiarism corpus; clustering method; error reduction; finger sampling; higher education; learning platform; performance index; plagiarism detection; text processing; winnowing based similar text positioning method; Computer science; Conferences; Educational institutions; Fingerprint recognition; Merging; Plagiarism; Software;
fLanguage
English
Publisher
ieee
Conference_Titel
Internet Technology and Applications, 2010 International Conference on
Conference_Location
Wuhan
Print_ISBN
978-1-4244-5142-5
Electronic_ISBN
978-1-4244-5143-2
Type
conf
DOI
10.1109/ITAPP.2010.5566138
Filename
5566138
Link To Document