• DocumentCode
    1994338
  • Title

    Optimizing the number of states, training iterations and Gaussians in an HMM-based handwritten word recognizer

  • Author

    Günter, Simon ; Bunke, Horst

  • Author_Institution
    Comput. Sci. Dept., Bern Univ., Switzerland
  • fYear
    2003
  • fDate
    3-6 Aug. 2003
  • Firstpage
    472
  • Abstract
    In off-line handwriting recognition, classifiers based on hidden Markov models (HMMs) have become very popular. However, while there exist well-established training algorithms, such as the Baum-Welsh procedure, which optimize the transition and output probabilities of a given HMM architecture, the architecture itself, and in particular the number of states, must be chosen "by hand". Also the number of training iterations and the output distributions need to be defined by the system designer. In this paper we examine some optimization strategies for an HMM classifier that works with continuous feature values and uses the Baum-Welch training algorithm. The free parameters of the optimization procedure introduced in this paper are the number of states of a model, the number of training iterations, and the number of Gaussian mixtures for each state. The proposed optimization strategies are evaluated in the context of a handwritten word recognition task.
  • Keywords
    Gaussian distribution; feature extraction; handwriting recognition; handwritten character recognition; hidden Markov models; optical character recognition; Baum-Welsh procedure; Gaussian mixtures; HMM architecture; HMM classifier; HMM-based handwritten word recognizer; continuous feature values; hidden Markov models; off-line handwriting recognition; optimization strategies; output distributions; state number optimization; training algorithms; training iteration optimization; transition optimization; Character recognition; Computer science; Electronic mail; Gaussian distribution; Gaussian processes; Handwriting recognition; Hidden Markov models; Maximum likelihood estimation; Speech recognition; Vocabulary;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Document Analysis and Recognition, 2003. Proceedings. Seventh International Conference on
  • Print_ISBN
    0-7695-1960-1
  • Type

    conf

  • DOI
    10.1109/ICDAR.2003.1227710
  • Filename
    1227710