مرکز منطقه ای اطلاع رساني علوم و فناوري - HMMs with mixtures of trend functions for automatic speech recognition

DocumentCode :

2286943

Title :

HMMs with mixtures of trend functions for automatic speech recognition

Author :

Deng, L. ; Aksmanovic, M.

Author_Institution :

Dept. of Electr. & Comput. Eng., Waterloo Univ., Ont., Canada

fYear :

1994

fDate :

13-16 Apr 1994

Firstpage :

702

Abstract :

In this study we extend the nonstationary-state (trended) HMM from the single-trend formulation of Deng (see Signal Processing, vo1.27, no.1, p. 65-78, 1992) to the mixture-trend one. This extension is motivated by the observation of wide variations in the trajectories of the acoustic data in fluent, speaker-independent speech associated with a given underlying linguistic unit. We show how HMMs with mixtures of trend functions can be implemented simply in the already well established singly trended HMM framework via the device of expanding each state into a set of parallel states. Details of a maximum-likelihood based algorithm are given for estimating state-dependent mixture trajectory parameters in the model. Experimental results on the task of classifying speaker-independent vowels excised from TIMIT database demonstrate consistent performance improvement using phonemic mixture-trended HMMs over their singly-trended counterpart

Keywords :

acoustic signal processing; hidden Markov models; maximum likelihood estimation; parameter estimation; speech analysis and processing; speech recognition; HMM; TIMIT database; acoustic data; automatic speech recognition; experimental results; linguistic unit; maximum-likelihood based algorithm; mixture-trend formulation; nonstationary-state; parallel states; parameter estimation; speaker-independent speech; speaker-independent vowels classification; state-dependent mixture trajectory parameters; trend functions; Acoustic devices; Automatic speech recognition; Gaussian processes; Hidden Markov models; Loudspeakers; Maximum likelihood estimation; Polynomials; Speech recognition; Stochastic processes; Underwater acoustics;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Speech, Image Processing and Neural Networks, 1994. Proceedings, ISSIPNN '94., 1994 International Symposium on

Print_ISBN :

0-7803-1865-X

Type :

conf

DOI :

10.1109/SIPNN.1994.344815

Filename :

344815

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2286943