Title :
HMMs with mixtures of trend functions for automatic speech recognition
Author :
Deng, L. ; Aksmanovic, M.
Author_Institution :
Dept. of Electr. & Comput. Eng., Waterloo Univ., Ont., Canada
Abstract :
In this study we extend the nonstationary-state (trended) HMM from the single-trend formulation of Deng (see Signal Processing, vo1.27, no.1, p. 65-78, 1992) to the mixture-trend one. This extension is motivated by the observation of wide variations in the trajectories of the acoustic data in fluent, speaker-independent speech associated with a given underlying linguistic unit. We show how HMMs with mixtures of trend functions can be implemented simply in the already well established singly trended HMM framework via the device of expanding each state into a set of parallel states. Details of a maximum-likelihood based algorithm are given for estimating state-dependent mixture trajectory parameters in the model. Experimental results on the task of classifying speaker-independent vowels excised from TIMIT database demonstrate consistent performance improvement using phonemic mixture-trended HMMs over their singly-trended counterpart
Keywords :
acoustic signal processing; hidden Markov models; maximum likelihood estimation; parameter estimation; speech analysis and processing; speech recognition; HMM; TIMIT database; acoustic data; automatic speech recognition; experimental results; linguistic unit; maximum-likelihood based algorithm; mixture-trend formulation; nonstationary-state; parallel states; parameter estimation; speaker-independent speech; speaker-independent vowels classification; state-dependent mixture trajectory parameters; trend functions; Acoustic devices; Automatic speech recognition; Gaussian processes; Hidden Markov models; Loudspeakers; Maximum likelihood estimation; Polynomials; Speech recognition; Stochastic processes; Underwater acoustics;
Conference_Titel :
Speech, Image Processing and Neural Networks, 1994. Proceedings, ISSIPNN '94., 1994 International Symposium on
Print_ISBN :
0-7803-1865-X
DOI :
10.1109/SIPNN.1994.344815