Title :
Non-relevant document reduction in anti-plagiarism using asymmetric similarity and AVL tree index
Author :
Oktoveri, Adeva ; Wibowo, Agung Toto ; Barmawi, Ari Moesriami
Author_Institution :
Inf. Dept., Telkom Univ., Bandung, Indonesia
Abstract :
Anti-plagiarism applications have been developed using various approaches. Many methods compare one document to others, regardless of their relevance. This paper proposes a method to reduce non-relevant documents (those having no similar topic with query document) by using asymmetric similarity. Whole documents are collected in one corpus. Each document is preprocessed using winnowing algorithm. The feature from winnowing is then indexed using AVL Tree algorithm to fasten document comparing process. The result shows that reducing non-relevant document shortens almost 10 times of the processing time compared to non-reduced process. Meanwhile, both processes show the same accuracy of 89.78% to give suspected documents.
Keywords :
document handling; query processing; tree data structures; AVL Tree algorithm; AVL tree index; antiplagiarism; antiplagiarism applications; asymmetric similarity; document comparing process; nonrelevant document reduction; one corpus; query document; winnowing algorithm; Accuracy; Fingerprint recognition; Indexes; Plagiarism; System analysis and design; Testing; Vegetation; AVL Tree; Information Retrieval; Longest Common Subsequence; Plagiarism Detection; Term Frequency; Winnowing;
Conference_Titel :
Intelligent and Advanced Systems (ICIAS), 2014 5th International Conference on
Conference_Location :
Kuala Lumpur
Print_ISBN :
978-1-4799-4654-9
DOI :
10.1109/ICIAS.2014.6869547