Identifying and Resolving Hidden Text Salting

作者:Moens Marie Francine; De Beer Jan; Boiy Erik; Carlos Gomez Juan
来源:IEEE Transactions on Information Forensics and Security, 2010, 5(4): 837-847.
DOI:10.1109/TIFS.2010.2063024

摘要

Hidden salting in digital media involves the intentional addition or distortion of content patterns with the purpose of content filtering. We propose a method to detect portions of a digital text source which are invisible to the end user, when they are rendered on a visual medium (like a computer monitor). The method consists of "tapping" into the rendering process and analyzing the rendering commands to identify portions of the source text (plaintext) which will be invisible for a human reader, using criteria based on text character and background colors, font size, overlapping characters, etc. Moreover, text deemed visible (covertext) is reconstructed from rendering commands and then the character reading order is identified, which could differ from the rendering order. The detection and resolution of hidden salting is evaluated on two e-mail corpora, and the effectiveness of this method in spam filtering task is assessed. We provide a solution to a relevant open problem in content filtering applications, namely the presence of tricks aimed at circumventing automatic filters.

  • 出版日期2010-12