• DocumentCode
    2031097
  • Title

    Machine vision for keyword spotting using pseudo 2D hidden Markov models

  • Author

    Kuo, Shyh-shiaw ; Agazzi, Oscar E.

  • Author_Institution
    AT&T Bell Lab., Murray Hill, NJ, USA
  • Volume
    5
  • fYear
    1993
  • fDate
    27-30 April 1993
  • Firstpage
    81
  • Abstract
    An algorithm for robust machine recognition of keywords embedded in a poorly printed document is presented. For each keyword, two statistical models, called pseudo-2D hidden Markov models (P2-DHMMs), are created for representing the actual keyword and all the other extraneous words, respectively. Dynamic programming is then used for matching an unknown input word with the two models and making a maximum likelihood decision. Although the models are pseudo 2-D in the sense that they are not fully connected 2-D networks, they are shown to be general enough to characterize printed words efficiently. These models facilitate a nice ´elastic matching´ property in both horizontal and vertical directions, which makes the recognizer not only independent of size and slant but also tolerant of highly deformed and noisy words. The system is evaluated on a synthetically created database which contains about 26000 words. A recognition accuracy of 99% is achieved when words in testing and training sets are in the same font size. An accuracy of 96% is achieved when they are in different sizes. In the latter case, the conventional 1-D HMM approach achieves only 70% accuracy rate.<>
  • Keywords
    computer vision; document image processing; dynamic programming; hidden Markov models; optical character recognition; dynamic programming; elastic matching; keyword spotting; machine vision; maximum likelihood decision; printed words; pseudo-2D hidden Markov models; recognition accuracy;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1993. ICASSP-93., 1993 IEEE International Conference on
  • Conference_Location
    Minneapolis, MN, USA
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-7402-9
  • Type

    conf

  • DOI
    10.1109/ICASSP.1993.319752
  • Filename
    319752