DocumentCode :
1994671
Title :
Proper names extraction from fax images combining textual and image features
Author :
Likforman-Sulem, Laurence ; Vaillant, Pascal ; Yvon, François
Author_Institution :
ENST, Paris, France
fYear :
2003
fDate :
3-6 Aug. 2003
Firstpage :
545
Abstract :
In the frame of a unified messaging system, a crucial task of the system is to provide the user with key information on every message received, like keywords reflecting the object of the message, or the name of the sender. However, in the case of facsimiles, this information is not as easy to detect as in the case of e-mails, since no standard headers are defined. The aim of the presented work is to identify and extract specific information (the name of the sender) from a fax cover page. For this purpose, methods based on image document analysis (OCR recognition, physical blocks selection), and text analysis methods (optimized dictionary lookup, local grammar rules), are implemented to work in parallel. The fusion of their results brings a more accurate guess than any of the methods would achieve separately.
Keywords :
dictionaries; document image processing; facsimile; feature extraction; optical character recognition; text analysis; OCR recognition; fax cover page; fax image; image document analysis; image feature extraction; local grammar rule; optical character recognition; optimized dictionary lookup; physical blocks selection; proper names extraction; text analysis; textual feature extraction; unified messaging system; Data mining; Electronic mail; Facsimile; Image analysis; Image recognition; Information retrieval; Labeling; Optical character recognition software; Optical distortion; Text analysis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Document Analysis and Recognition, 2003. Proceedings. Seventh International Conference on
Print_ISBN :
0-7695-1960-1
Type :
conf
DOI :
10.1109/ICDAR.2003.1227724
Filename :
1227724
Link To Document :
بازگشت