DocumentCode
1994671
Title
Proper names extraction from fax images combining textual and image features
Author
Likforman-Sulem, Laurence ; Vaillant, Pascal ; Yvon, François
Author_Institution
ENST, Paris, France
fYear
2003
fDate
3-6 Aug. 2003
Firstpage
545
Abstract
In the frame of a unified messaging system, a crucial task of the system is to provide the user with key information on every message received, like keywords reflecting the object of the message, or the name of the sender. However, in the case of facsimiles, this information is not as easy to detect as in the case of e-mails, since no standard headers are defined. The aim of the presented work is to identify and extract specific information (the name of the sender) from a fax cover page. For this purpose, methods based on image document analysis (OCR recognition, physical blocks selection), and text analysis methods (optimized dictionary lookup, local grammar rules), are implemented to work in parallel. The fusion of their results brings a more accurate guess than any of the methods would achieve separately.
Keywords
dictionaries; document image processing; facsimile; feature extraction; optical character recognition; text analysis; OCR recognition; fax cover page; fax image; image document analysis; image feature extraction; local grammar rule; optical character recognition; optimized dictionary lookup; physical blocks selection; proper names extraction; text analysis; textual feature extraction; unified messaging system; Data mining; Electronic mail; Facsimile; Image analysis; Image recognition; Information retrieval; Labeling; Optical character recognition software; Optical distortion; Text analysis;
fLanguage
English
Publisher
ieee
Conference_Titel
Document Analysis and Recognition, 2003. Proceedings. Seventh International Conference on
Print_ISBN
0-7695-1960-1
Type
conf
DOI
10.1109/ICDAR.2003.1227724
Filename
1227724
Link To Document