Title :
Similar Document Detection with Limited Information Disclosure
Author :
Jiang, Wei ; Murugesan, Mummoorthy ; Clifton, Chris ; Si, Luo
Author_Institution :
Dept. of Comput. Sci., Purdue Univ., West Lafayette, IN
Abstract :
Similar document detection plays important roles in many applications, such as file management, copyright protection, and plagiarism prevention. Existing protocols assume that the contents of files stored on a server (or multiple servers) are directly accessible. This assumption limits more practical applications, e.g., detecting plagiarized documents between two conferences, where submissions are confidential. We propose novel protocols to detect similar documents between two entities where documents cannot be openly shared with each other. We also conduct experiments to show the practical value of the proposed protocols.
Keywords :
document handling; copyright protection; file management; limited information disclosure; plagiarism prevention; similar document detection; Access protocols; Application software; Computer science; Copyright protection; File servers; Fingerprint recognition; Information retrieval; Monitoring; Plagiarism; Privacy;
Conference_Titel :
Data Engineering, 2008. ICDE 2008. IEEE 24th International Conference on
Conference_Location :
Cancun
Print_ISBN :
978-1-4244-1836-7
Electronic_ISBN :
978-1-4244-1837-4
DOI :
10.1109/ICDE.2008.4497482