• DocumentCode
    1283377
  • Title

    Identifying and Resolving Hidden Text Salting

  • Author

    Moens, Marie-Francine ; De Beer, Jan ; Boiy, Erik ; Gomez, Juan Carlos

  • Author_Institution
    Dept. of Comput. Sci., Katholieke Univ. Leuven, Heverlee, Belgium
  • Volume
    5
  • Issue
    4
  • fYear
    2010
  • Firstpage
    837
  • Lastpage
    847
  • Abstract
    Hidden salting in digital media involves the intentional addition or distortion of content patterns with the purpose of content filtering. We propose a method to detect portions of a digital text source which are invisible to the end user, when they are rendered on a visual medium (like a computer monitor). The method consists of “tapping” into the rendering process and analyzing the rendering commands to identify portions of the source text (plaintext) which will be invisible for a human reader, using criteria based on text character and background colors, font size, overlapping characters, etc. Moreover, text deemed visible (covertext) is reconstructed from rendering commands and then the character reading order is identified, which could differ from the rendering order. The detection and resolution of hidden salting is evaluated on two e-mail corpora, and the effectiveness of this method in spam filtering task is assessed. We provide a solution to a relevant open problem in content filtering applications, namely the presence of tricks aimed at circumventing automatic filters.
  • Keywords
    information filtering; media streaming; rendering (computer graphics); content filtering; digital media; e-mail; hidden text salting; rendering process; Computer displays; Computer science; Content management; Digital filters; Electronic mail; Fellows; Filtering; HTML; Humans; Image color analysis; Image reconstruction; Partitioning algorithms; Permission; Rendering (computer graphics); Content filtering; content manipulation;
  • fLanguage
    English
  • Journal_Title
    Information Forensics and Security, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1556-6013
  • Type

    jour

  • DOI
    10.1109/TIFS.2010.2063024
  • Filename
    5535168