DocumentCode :
3496310
Title :
Bilingual plagiarism detector
Author :
Arefin, Mohammad Shamsul ; Morimoto, Vasuhiko ; Sharif, Mohammad Amir
Author_Institution :
Grad. Sch. of Eng., Hiroshima Univ., Hiroshima, Japan
fYear :
2011
fDate :
22-24 Dec. 2011
Firstpage :
451
Lastpage :
456
Abstract :
Internet has become primary medium for information access, commerce in today´s globalized world and almost every information is available in the Internet either in the native language of the user or in a non-native language. Therefore, it becomes easier to use another author´s contents from the Internet without proper citation or reference and this tendency is increasing day-by-day. Such use of another author´s contents, thoughts, ideas, or expressions and the representation of them as one´s own original work is known as plagiarism. Though plagiarism can be found in almost every field, it is a major problem in academic area as plagiarism destroys individual´s creativity and originality and defeats the purpose of education. At present many commercial and noncommercial plagiarism detection software are available. However, most of them are unilingual in nature and none of them considers checking of Bangla documents for plagiarism. In this paper, we have introduced statistical method and method based on individual content for detecting plagiarism from English and Bangla electronic documents. The first method performs different statistical analysis of the documents for plagiarism detection whereas the second method is based on the analysis of individual contents of the documents. The system can perform plagiarism checking in a Bangla document from English documents and vice versa. It can also detect plagiarism from the documents of the same language. The system has been evaluated by real documents. We have found that our system can detect plagiarism from documents of two different languages efficiently.
Keywords :
document handling; natural language processing; security of data; statistical analysis; Bangla electronic document; English electronic document; Internet; bilingual plagiarism detector; education purpose; individual creativity; individual originality; information access; native language; nonnative language; statistical analysis; statistical method; Databases; Generators; User interfaces; Plagiarism; documents relevancy; query execution; root detection; statistical analysis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer and Information Technology (ICCIT), 2011 14th International Conference on
Conference_Location :
Dhaka
Print_ISBN :
978-1-61284-907-2
Type :
conf
DOI :
10.1109/ICCITechn.2011.6164832
Filename :
6164832
Link To Document :
بازگشت