• DocumentCode
    2382116
  • Title

    A Window-Based Feature Extraction Method in Document Copy Detection

  • Author

    Li, Xu ; Liu, Guo-Hua ; Ma, Hui-Dong

  • fYear
    2007
  • fDate
    1-3 Nov. 2007
  • Firstpage
    215
  • Lastpage
    217
  • Abstract
    Document copy detection is an important tool to protect author´s intellectual property and to improve efficiency of digital library. It uses the extracted text features to identify copying between documents, therefore the feature extrac- tion method crucially affects the performance of a document copy detection system. This paper introduces a window- based feature extraction method and makes three contribu- tions: it can identify any matches of a certain length; it can produce the describing information where overlap occurs between documents; it can provide the results with different precision. We report the experimental result that validates the behaviors and properties of the proposed method.
  • Keywords
    Computer science; Data mining; Data privacy; Feature extraction; Frequency; Information retrieval; Intellectual property; Protection; Software libraries; Spatial databases;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data, Privacy, and E-Commerce, 2007. ISDPE 2007. The First International Symposium on
  • Conference_Location
    Chengdu
  • Print_ISBN
    978-0-7695-3016-1
  • Type

    conf

  • DOI
    10.1109/ISDPE.2007.46
  • Filename
    4402677