DocumentCode
1283377
Title
Identifying and Resolving Hidden Text Salting
Author
Moens, Marie-Francine ; De Beer, Jan ; Boiy, Erik ; Gomez, Juan Carlos
Author_Institution
Dept. of Comput. Sci., Katholieke Univ. Leuven, Heverlee, Belgium
Volume
5
Issue
4
fYear
2010
Firstpage
837
Lastpage
847
Abstract
Hidden salting in digital media involves the intentional addition or distortion of content patterns with the purpose of content filtering. We propose a method to detect portions of a digital text source which are invisible to the end user, when they are rendered on a visual medium (like a computer monitor). The method consists of “tapping” into the rendering process and analyzing the rendering commands to identify portions of the source text (plaintext) which will be invisible for a human reader, using criteria based on text character and background colors, font size, overlapping characters, etc. Moreover, text deemed visible (covertext) is reconstructed from rendering commands and then the character reading order is identified, which could differ from the rendering order. The detection and resolution of hidden salting is evaluated on two e-mail corpora, and the effectiveness of this method in spam filtering task is assessed. We provide a solution to a relevant open problem in content filtering applications, namely the presence of tricks aimed at circumventing automatic filters.
Keywords
information filtering; media streaming; rendering (computer graphics); content filtering; digital media; e-mail; hidden text salting; rendering process; Computer displays; Computer science; Content management; Digital filters; Electronic mail; Fellows; Filtering; HTML; Humans; Image color analysis; Image reconstruction; Partitioning algorithms; Permission; Rendering (computer graphics); Content filtering; content manipulation;
fLanguage
English
Journal_Title
Information Forensics and Security, IEEE Transactions on
Publisher
ieee
ISSN
1556-6013
Type
jour
DOI
10.1109/TIFS.2010.2063024
Filename
5535168
Link To Document