DocumentCode :
542330
Title :
On application of the Faragó-Lugosi algorithm in speech recognition
Author :
Roberts, William J.J. ; Ephraim, Yariv
Author_Institution :
Atlantic Coast Technologies Inc., 11499 Columbia Pike, Silver Spring, MD, 20904, USA
Volume :
1
fYear :
2002
fDate :
13-17 May 2002
Abstract :
In 1989 a new algorithm for non-iterative global maximization of the joint likelihood function of state and observation sequences of left-right HMM´s was developed by Faragó and Lugosi. The algorithm capitalizes on the fact that the state sequence of a left-right HMM is uniquely determined by the state duration occupancies, The algorithm is mostly suitable for parameter estimation from a single training sequence. Extensions to estimation from multiple sequences are possible but deemed impractical. Two alternatives are proposed for utilizing this algorithm in automatic speech recognition. The first is based on averaging the parameter estimates from individual sequences while the second uses the Faragó and Lugosi segmentation to initialize the segmental k-means or the Baum algorithm. We have implemented the algorithm and tested it in isolated digit recognition. Using the second approach, a reduction of the error rate from .65% to .36% was realized
Keywords :
Artificial neural networks; Estimation; Hidden Markov models;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing (ICASSP), 2002 IEEE International Conference on
Conference_Location :
Orlando, FL, USA
ISSN :
1520-6149
Print_ISBN :
0-7803-7402-9
Type :
conf
DOI :
10.1109/ICASSP.2002.5743948
Filename :
5743948
Link To Document :
بازگشت