DocumentCode :
2286943
Title :
HMMs with mixtures of trend functions for automatic speech recognition
Author :
Deng, L. ; Aksmanovic, M.
Author_Institution :
Dept. of Electr. & Comput. Eng., Waterloo Univ., Ont., Canada
fYear :
1994
fDate :
13-16 Apr 1994
Firstpage :
702
Abstract :
In this study we extend the nonstationary-state (trended) HMM from the single-trend formulation of Deng (see Signal Processing, vo1.27, no.1, p. 65-78, 1992) to the mixture-trend one. This extension is motivated by the observation of wide variations in the trajectories of the acoustic data in fluent, speaker-independent speech associated with a given underlying linguistic unit. We show how HMMs with mixtures of trend functions can be implemented simply in the already well established singly trended HMM framework via the device of expanding each state into a set of parallel states. Details of a maximum-likelihood based algorithm are given for estimating state-dependent mixture trajectory parameters in the model. Experimental results on the task of classifying speaker-independent vowels excised from TIMIT database demonstrate consistent performance improvement using phonemic mixture-trended HMMs over their singly-trended counterpart
Keywords :
acoustic signal processing; hidden Markov models; maximum likelihood estimation; parameter estimation; speech analysis and processing; speech recognition; HMM; TIMIT database; acoustic data; automatic speech recognition; experimental results; linguistic unit; maximum-likelihood based algorithm; mixture-trend formulation; nonstationary-state; parallel states; parameter estimation; speaker-independent speech; speaker-independent vowels classification; state-dependent mixture trajectory parameters; trend functions; Acoustic devices; Automatic speech recognition; Gaussian processes; Hidden Markov models; Loudspeakers; Maximum likelihood estimation; Polynomials; Speech recognition; Stochastic processes; Underwater acoustics;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Speech, Image Processing and Neural Networks, 1994. Proceedings, ISSIPNN '94., 1994 International Symposium on
Print_ISBN :
0-7803-1865-X
Type :
conf
DOI :
10.1109/SIPNN.1994.344815
Filename :
344815
Link To Document :
بازگشت