Title :
E-mail signature block analysis
Author :
Chen, Hao ; Hu, Jianying ; Sproat, Richard W.
Author_Institution :
CENPARMI, Concordia Univ., Montreal, Que., Canada
Abstract :
The signature block is a common structured component found in e-mail messages. Accurate identification and analysis of signature blocks are important in many multimedia messaging and information retrieval applications such as e-mail text-to-speech rendering. Traditional text analysis methods designed to deal with sequential text cannot handle 2D structures, while the highly unconstrained nature of signature blocks makes the application of 2D grammars very difficult. In this paper we describe an algorithm for signature block analysis which combines 2D structural segmentation with 1D grammatical constraints. The information obtained from both geometrical and linguistic analysis are integrated in a form of weighted finite state transducers, and the final solution is the optimal interpretation under both constraints
Keywords :
character recognition; computational linguistics; document image processing; electronic mail; grammars; multimedia communication; 1D grammatical constraints; 2D structural segmentation; e-mail messages; geometrical analysis; grammars; linguistic analysis; multimedia messaging; parsing; signature block; weighted finite state transducers; Algorithm design and analysis; Design methodology; Electronic mail; Information analysis; Information retrieval; Multimedia databases; Postal services; Speech synthesis; Text analysis; Transducers;
Conference_Titel :
Pattern Recognition, 1998. Proceedings. Fourteenth International Conference on
Conference_Location :
Brisbane, Qld.
Print_ISBN :
0-8186-8512-3
DOI :
10.1109/ICPR.1998.711900