• DocumentCode
    2056780
  • Title

    Font recognition and contextual processing for more accurate text recognition

  • Author

    Shi, Hongwei ; Pavlidis, Theo

  • Author_Institution
    Dept. of Comput. Sci., State Univ. of New York, Stony Brook, NY, USA
  • Volume
    1
  • fYear
    1997
  • fDate
    18-20 Aug 1997
  • Firstpage
    39
  • Abstract
    Font recognition and contextual processing are developed as two components that enhance the recognition accuracy of a text recognition system presented in a previous paper ((H. Shi and T. Pavlidis, 1996). Font information is extracted from two sources: one is the global page properties, and the other is the graph matching result of recognized short words such as a, it and of etc. Contextual processing is done by first composing word candidates from the recognition results and then checking each candidate with a dictionary through a spelling checker. Positional binary trigrams and word affixes are used to prune the search for word candidates
  • Keywords
    character sets; document image processing; graph theory; search problems; spelling aids; word processing; accurate text recognition; contextual processing; dictionary; font information; font recognition; global page properties; graph matching result; positional binary trigrams; recognition accuracy; recognized short words; search pruning; spelling checker; text recognition system; word affixes; word candidates; Character recognition; Computer science; Dictionaries; Feature extraction; Image databases; Image generation; Image recognition; Optical character recognition software; Prototypes; Text recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Document Analysis and Recognition, 1997., Proceedings of the Fourth International Conference on
  • Conference_Location
    Ulm
  • Print_ISBN
    0-8186-7898-4
  • Type

    conf

  • DOI
    10.1109/ICDAR.1997.619810
  • Filename
    619810