Title :
The development of cross-language plagiarism detection tool utilising fuzzy swarm-based summarisation
Author :
Alzahrani, Salha ; Salim, Naomie ; Kent, Chow Kok ; Binwahlan, Mohammed Salem ; Suanmali, Ladda
Author_Institution :
CS & Info. Sys., Taif Univ., Taif, Saudi Arabia
fDate :
Nov. 29 2010-Dec. 1 2010
Abstract :
This work presents the design and development of a web-based system that supports cross-language similarity analysis and plagiarism detection. A suspicious document dq in a language Lq is to be submitted to the system via a PHP web-based interface. The system will accept the text through either uploading or pasting it directly to a text-area. In order to lighten large texts and provide an ideal set of queries, we introduce the idea of query document reduction via summarisation. Our proposed system utilised a fuzzy swarm-based summarisation tool originally built in Java. Then, the summary is used as a query to find similar web resources in languages Lx other than Lq via a dictionary-based translation. Thereafter, a detailed similarity analysis across the languages Lq and Lx is performed and friendly report of results is produced. Such report has global similarity score on the whole document, which assures high flexibility of utilisation.
Keywords :
Internet; language translation; natural language processing; text analysis; Java; PHP Web-based interface; Web-based system; cross-language plagiarism detection; cross-language similarity analysis; dictionary-based translation; fuzzy swarm-based summarisation; query document reduction; cross-language; fuzzy swarm-based summarisation; plagiarism detection; web-based;
Conference_Titel :
Intelligent Systems Design and Applications (ISDA), 2010 10th International Conference on
Conference_Location :
Cairo
Print_ISBN :
978-1-4244-8134-7
DOI :
10.1109/ISDA.2010.5687287