• DocumentCode
    2839381
  • Title

    Design of a linguistic postprocessor using variable memory length Markov models

  • Author

    Guyon, Isabelle ; Pereira, Fernando

  • Author_Institution
    AT&T Bell Labs., USA
  • Volume
    1
  • fYear
    1995
  • fDate
    14-16 Aug 1995
  • Firstpage
    454
  • Abstract
    We describe a linguistic postprocessor for character recognizers. The central module of our system is a trainable variable memory length Markov model (VLMM) that predicts the next character given a variable length window of past characters. The overall system is composed of several finite state automata, including the main VLMM and a proper noun VLMM. The best model reported in the literature (Brown et al., 1992) achieves 1.75 bits per character on the Brown corpus. On that same corpus, our model, trained on 10 times less data, reaches 2.19 bits per character and is 200 times smaller (≃160,000 parameters). The model was designed for handwriting recognition applications but could also be used for other OCR problems and speech recognition
  • Keywords
    Markov processes; computational linguistics; finite automata; handwriting recognition; Markov models; OCR; VLMM; character recognizers; finite state automata; handwriting recognition; linguistic postprocessor; variable memory length; variable memory length Markov model; Automata; Character recognition; Feedback; Handwriting recognition; Humans; Optical character recognition software; Predictive models; Speech recognition; Vocabulary; Writing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Document Analysis and Recognition, 1995., Proceedings of the Third International Conference on
  • Conference_Location
    Montreal, Que.
  • Print_ISBN
    0-8186-7128-9
  • Type

    conf

  • DOI
    10.1109/ICDAR.1995.599034
  • Filename
    599034