DocumentCode :
3773923
Title :
Human Behavior Recognition: Semantics-Based Text Copy Detection Method
Author :
Liu Yang;Jie Xi
Author_Institution :
Binjiang Coll., Nanjing Univ. of Inf. Sci. &
fYear :
2015
Firstpage :
158
Lastpage :
162
Abstract :
Text document is the most widely used medium on the Internet. However, there are some emerging problems that cannot be neglected, such as plagiarism, reproduction of information content, illicit redistribution, and copyright disputes etc. Now plagiarists have become more and more "clever", they could rewrite the contents by using synonym substitution, syntactic variation and other methods. The traditional copy detection methods that use precise matching or similar string matching algorithms cannot apply to the circumstance of semantics-based copy method. To meet the challenge of supporting semantics-based copy detection, for the first time this paper proposes a semantics-based copy detection method supporting similarity ranking. Similarity scores between the suspicious text and each text from corpus are calculated using our proposed similarity calculation method. At last, top-k texts from corpus, which have high similarity scores with the suspicious text, are ranked and listed in descending order of the score. Experiments on the real-world dataset further show that our proposed solution is very efficient and effective in supporting semantics-based copy detection.
Keywords :
"Feature extraction","Detection algorithms","Frequency modulation","Data mining","Thesauri","Plagiarism","Buildings"
Publisher :
ieee
Conference_Titel :
Computational Intelligence Theory, Systems and Applications (CCITSA), 2015 First International Conference on
Type :
conf
DOI :
10.1109/CCITSA.2015.28
Filename :
7473108
Link To Document :
بازگشت