DocumentCode :
2286848
Title :
Corrective tuning by applying LVQ for continuous density and semi-continuous Markov models
Author :
Kurimo, Mikko
Author_Institution :
Lab. of Comput. & Inf. Sci., Helsinki Univ. of Technol., Espoo, Finland
fYear :
1994
fDate :
13-16 Apr 1994
Firstpage :
718
Abstract :
In this work the objective is to increase the accuracy of speaker dependent phonetic transcription of spoken utterances using continuous density and semi-continuous HMMs. Experiments with LVQ based corrective tuning indicate that the average recognition error rate can be made to decrease about 5%-10%. Experiments are also made to increase the efficiency of the Viterbi decoding by a discriminative approximation of the output probabilities of the states in the Markov models. Using only a few nearest components of the mixture density functions instead of every component decreases both the recognition error rate (5%-10% for CDHMMs) and the execution time (about 50% for SCHMMs). The lowest average error rates achieved were about 5.6%
Keywords :
decoding; hidden Markov models; learning (artificial intelligence); maximum likelihood estimation; speech recognition; vector quantisation; HMMs; LVQ; Viterbi decoding; average recognition error rate; continuous density Markov models; corrective tuning; execution time; learning vector quantisation; mixture density functions; output probabilities; recognition error rate; semi-continuous Markov models; speaker dependent phonetic transcription; spoken utterances; Density functional theory; Error analysis; Hidden Markov models; Image processing; Labeling; Laboratories; Neural networks; Speech processing; Viterbi algorithm; Voting;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Speech, Image Processing and Neural Networks, 1994. Proceedings, ISSIPNN '94., 1994 International Symposium on
Print_ISBN :
0-7803-1865-X
Type :
conf
DOI :
10.1109/SIPNN.1994.344811
Filename :
344811
Link To Document :
بازگشت