• DocumentCode
    2631497
  • Title

    Joint normalization and recognition of degraded document images using psuedo-2D hidden Markov models

  • Author

    Agazzi, O.E. ; Kuo, S.

  • Author_Institution
    AT&T Bell Lab., Murray Hill, NJ, USA
  • fYear
    1993
  • fDate
    20-22 Oct 1993
  • Firstpage
    155
  • Lastpage
    158
  • Abstract
    The authors introduce a method to render optical character recognition algorithms based on pseudo two-dimensional hidden Markov models (PHMMs) independent of image transformations such as scaling, translations, slant, vertical and/or horizontal stretching, etc. Estimation of transformation parameters and image normalization are performed simultaneously with recognition. When combined with a previous method for joint segmentation and recognition of connected and degraded text, this method can be used to recognize extremely degraded documents that include characters affected by various geometric transformations. Experiments with isolated characters where scaling, slant angle, and translation are varied over ranges of 4: 1, 0° to 45°, and 0 to 40 pixels respectively, are presented. Also presented are experiments with connected text where images have been affected both by geometric and stochastic distortions of various degrees, that show the high effectiveness of this technique
  • Keywords
    document handling; document image processing; hidden Markov models; image segmentation; optical character recognition; PHMMs; connected text; degraded document images; degraded text; geometric transformations; image normalization; image transformations; optical character recognition algorithms; pseudo two-dimensional hidden Markov models; psuedo-2D hidden Markov models; segmentation; stochastic distortions; Character recognition; Degradation; Hidden Markov models; Image recognition; Image segmentation; Optical character recognition software; Optical distortion; Parameter estimation; Rendering (computer graphics); Text recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Document Analysis and Recognition, 1993., Proceedings of the Second International Conference on
  • Conference_Location
    Tsukuba Science City
  • Print_ISBN
    0-8186-4960-7
  • Type

    conf

  • DOI
    10.1109/ICDAR.1993.395760
  • Filename
    395760