DocumentCode :
692722
Title :
Web crawler utilization for resource search on Indonesian anti-plagiarism detection: Pemanfaatan web crawler untuk pencarian referensi pada deteksi anti-plagiarisme dokumen Bahasa Indonesia
Author :
Wibowo, Agung Toto ; Arifianto, Anditya ; Oktoveri, Adeva ; Barmawi, Ari Moesriami
Author_Institution :
Inf. Dept., Telkom Univ. Bandung, Bandung, Indonesia
fYear :
2013
fDate :
3-4 Dec. 2013
Firstpage :
117
Lastpage :
121
Abstract :
Matching one document with other documents is one of anti-plagiarism tasks. Matching can be performed both intra and extra-corpal. This paper will discuss extra-corpal matching utilize the web crawlers as reference search. The role of web-crawler described in extra-corpal anti-plagiarism architecture. Matching of plagiarism indication will use Modified Histogram Intersection based on N-Gram of term. Similarity value utilizing modified normalized histogram intersection that devoted to matching extra corpal. Based on our experiment the best accuracy is given in 0.4 and 0.5 threshold value that give 94% accuracy.
Keywords :
document handling; natural language processing; pattern matching; search engines; Indonesian anti-plagiarism detection; Web crawler; document matching; extra-corpal anti-plagiarism architecture; extra-corpal matching; modified normalized histogram intersection; n-gram; reference search; resource search; similarity value; Accuracy; Crawlers; Educational institutions; Histograms; Plagiarism; Portals; Web pages; Anti-Plagiarism Architecture; Anti-plagiarism; Extra-corpal; Modified Histogram Intersection;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computational Intelligence and Cybernetics (CYBERNETICSCOM), 2013 IEEE International Conference on
Conference_Location :
Yogyakarta
Type :
conf
DOI :
10.1109/CyberneticsCom.2013.6865793
Filename :
6865793
Link To Document :
بازگشت