• DocumentCode
    3236550
  • Title

    Dynamic planar warping for optical character recognition

  • Author

    Levin, Esther ; Pieraccini, Roberto

  • Author_Institution
    AT&T Bell Labs., Murray Hill, NJ, USA
  • Volume
    3
  • fYear
    1992
  • fDate
    23-26 Mar 1992
  • Firstpage
    149
  • Abstract
    The authors extend the dynamic time warping (DTW) algorithm, widely used in automatic speech recognition (ASR), to a dynamic plane warping (DPW) algorithm, for application in the field of optical character recognition (OCR) or similar applications. Although direct application of the optimality principle reduced the computational complexity somewhat, the DPW (or image alignment) problem is exponential in the dimensions of the image. It is shown that by applying constraints to the image alignment problem, e.g., limiting the class of possible distortions, one can reduce the computational complexity dramatically, and find the optimal solution to the constrained problem in linear time. A statistical model, the planar hidden Markov model (PHMM), describing statistical properties of images is proposed. The PHMM approach was evaluated using a set of isolated handwritten digits. An overall digit recognition accuracy of 95% was achieved. It is expected that the advantage of this approach will be even more significant for harder tasks, such cursive-writing recognition and spotting
  • Keywords
    hidden Markov models; optical character recognition; DTW; OCR; PHMM; automatic speech recognition; computational complexity; constrained problem; digit recognition accuracy; distortions; dynamic plane warping; dynamic time warping; handwritten digits; image alignment; linear time; optical character recognition; optimality principle; planar hidden Markov model; statistical model; statistical properties; Automatic speech recognition; Character recognition; Computational complexity; Dynamic programming; Handwriting recognition; Hidden Markov models; Lattices; Optical character recognition software; Optical distortion; Testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1992. ICASSP-92., 1992 IEEE International Conference on
  • Conference_Location
    San Francisco, CA
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-0532-9
  • Type

    conf

  • DOI
    10.1109/ICASSP.1992.226254
  • Filename
    226254