DocumentCode
2382116
Title
A Window-Based Feature Extraction Method in Document Copy Detection
Author
Li, Xu ; Liu, Guo-Hua ; Ma, Hui-Dong
fYear
2007
fDate
1-3 Nov. 2007
Firstpage
215
Lastpage
217
Abstract
Document copy detection is an important tool to protect author´s intellectual property and to improve efficiency of digital library. It uses the extracted text features to identify copying between documents, therefore the feature extrac- tion method crucially affects the performance of a document copy detection system. This paper introduces a window- based feature extraction method and makes three contribu- tions: it can identify any matches of a certain length; it can produce the describing information where overlap occurs between documents; it can provide the results with different precision. We report the experimental result that validates the behaviors and properties of the proposed method.
Keywords
Computer science; Data mining; Data privacy; Feature extraction; Frequency; Information retrieval; Intellectual property; Protection; Software libraries; Spatial databases;
fLanguage
English
Publisher
ieee
Conference_Titel
Data, Privacy, and E-Commerce, 2007. ISDPE 2007. The First International Symposium on
Conference_Location
Chengdu
Print_ISBN
978-0-7695-3016-1
Type
conf
DOI
10.1109/ISDPE.2007.46
Filename
4402677
Link To Document