• DocumentCode
    1994671
  • Title

    Proper names extraction from fax images combining textual and image features

  • Author

    Likforman-Sulem, Laurence ; Vaillant, Pascal ; Yvon, François

  • Author_Institution
    ENST, Paris, France
  • fYear
    2003
  • fDate
    3-6 Aug. 2003
  • Firstpage
    545
  • Abstract
    In the frame of a unified messaging system, a crucial task of the system is to provide the user with key information on every message received, like keywords reflecting the object of the message, or the name of the sender. However, in the case of facsimiles, this information is not as easy to detect as in the case of e-mails, since no standard headers are defined. The aim of the presented work is to identify and extract specific information (the name of the sender) from a fax cover page. For this purpose, methods based on image document analysis (OCR recognition, physical blocks selection), and text analysis methods (optimized dictionary lookup, local grammar rules), are implemented to work in parallel. The fusion of their results brings a more accurate guess than any of the methods would achieve separately.
  • Keywords
    dictionaries; document image processing; facsimile; feature extraction; optical character recognition; text analysis; OCR recognition; fax cover page; fax image; image document analysis; image feature extraction; local grammar rule; optical character recognition; optimized dictionary lookup; physical blocks selection; proper names extraction; text analysis; textual feature extraction; unified messaging system; Data mining; Electronic mail; Facsimile; Image analysis; Image recognition; Information retrieval; Labeling; Optical character recognition software; Optical distortion; Text analysis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Document Analysis and Recognition, 2003. Proceedings. Seventh International Conference on
  • Print_ISBN
    0-7695-1960-1
  • Type

    conf

  • DOI
    10.1109/ICDAR.2003.1227724
  • Filename
    1227724