• DocumentCode
    3206022
  • Title

    A New Document Masking Approach for Removing Confidential Information

  • Author

    Ikawa, Yohei ; Kanayama, Hiroshi

  • Author_Institution
    Tokyo Res. Lab., Tokyo
  • fYear
    2007
  • fDate
    23-26 July 2007
  • Firstpage
    107
  • Lastpage
    114
  • Abstract
    In order to protect confidential information such as personal and organizational information written as text, document masking techniques are becoming important. Such document masking methods extract humans, places, and organization names automatically and remove them, so they make documents harmless and allow sharing them safely within an organization, and contribute to improving productivity. However, existing automatic document masking techniques are not reliable enough since they may fail to mask out-of-vocabulary proper nouns. In this paper we propose a novel technique for document masking, the Unmasking Method, in which all of the words are hidden initially and a human specifies the non-confidential words to be unmasked. The proposed method is a high-safety document masking method since it unmasks only words that a human has manually recognized as safe. Our experimental results show its safety and effectiveness.
  • Keywords
    data privacy; feature extraction; text analysis; confidential information protection; document masking; feature extraction; text analysis; unmasking method; Data mining; Dictionaries; Humans; Laboratories; Productivity; Protection; Safety;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    E-Commerce Technology and the 4th IEEE International Conference on Enterprise Computing, E-Commerce, and E-Services, 2007. CEC/EEE 2007. The 9th IEEE International Conference on
  • Conference_Location
    Tokyo
  • Print_ISBN
    0-7695-2913-5
  • Type

    conf

  • DOI
    10.1109/CEC-EEE.2007.8
  • Filename
    4285205