DocumentCode :
3722612
Title :
Document Copy Detection Using the Improved Fuzzy Hashing
Author :
Guohua Wu;Ershuai Fu;Liuyang Wang;Mengmeng Zhao
Author_Institution :
Sch. of Comput. Sci. &
fYear :
2015
Firstpage :
55
Lastpage :
60
Abstract :
Document copy detection is an effective method that can protect intellectual property rights as well as improve the efficiency of information retrieval. To our knowledge, it is a common method that using the fingerprints of one document in the process of detecting. Therefore, selecting the appropriate document fingerprints plays a key role. This paper firstly describes several mature methods of selecting document fingerprints, and analyzes their merit and demerit. Then we review the principle of Fuzzy Hashing, which suffers from the instability and inefficiency of fragmenting. To resolve the critical problems, we finally propose a novel algorithm based on the Fuzzy Hashing. Compared to original method, the proposed document copy detection algorithm can not only ensure the proper size of fragment but also enhance the speed of fragmenting. And in terms of efficiency and accuracy, the algorithm achieves high performance.
Keywords :
"Fingerprint recognition","Encoding","Algorithm design and analysis","Computer science","Interference","Intellectual property","Information retrieval"
Publisher :
ieee
Conference_Titel :
Computer Science and Mechanical Automation (CSMA), 2015 International Conference on
Type :
conf
DOI :
10.1109/CSMA.2015.18
Filename :
7371622
Link To Document :
بازگشت