DocumentCode :
1994338
Title :
Optimizing the number of states, training iterations and Gaussians in an HMM-based handwritten word recognizer
Author :
Günter, Simon ; Bunke, Horst
Author_Institution :
Comput. Sci. Dept., Bern Univ., Switzerland
fYear :
2003
fDate :
3-6 Aug. 2003
Firstpage :
472
Abstract :
In off-line handwriting recognition, classifiers based on hidden Markov models (HMMs) have become very popular. However, while there exist well-established training algorithms, such as the Baum-Welsh procedure, which optimize the transition and output probabilities of a given HMM architecture, the architecture itself, and in particular the number of states, must be chosen "by hand". Also the number of training iterations and the output distributions need to be defined by the system designer. In this paper we examine some optimization strategies for an HMM classifier that works with continuous feature values and uses the Baum-Welch training algorithm. The free parameters of the optimization procedure introduced in this paper are the number of states of a model, the number of training iterations, and the number of Gaussian mixtures for each state. The proposed optimization strategies are evaluated in the context of a handwritten word recognition task.
Keywords :
Gaussian distribution; feature extraction; handwriting recognition; handwritten character recognition; hidden Markov models; optical character recognition; Baum-Welsh procedure; Gaussian mixtures; HMM architecture; HMM classifier; HMM-based handwritten word recognizer; continuous feature values; hidden Markov models; off-line handwriting recognition; optimization strategies; output distributions; state number optimization; training algorithms; training iteration optimization; transition optimization; Character recognition; Computer science; Electronic mail; Gaussian distribution; Gaussian processes; Handwriting recognition; Hidden Markov models; Maximum likelihood estimation; Speech recognition; Vocabulary;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Document Analysis and Recognition, 2003. Proceedings. Seventh International Conference on
Print_ISBN :
0-7695-1960-1
Type :
conf
DOI :
10.1109/ICDAR.2003.1227710
Filename :
1227710
Link To Document :
بازگشت