• DocumentCode
    2286943
  • Title

    HMMs with mixtures of trend functions for automatic speech recognition

  • Author

    Deng, L. ; Aksmanovic, M.

  • Author_Institution
    Dept. of Electr. & Comput. Eng., Waterloo Univ., Ont., Canada
  • fYear
    1994
  • fDate
    13-16 Apr 1994
  • Firstpage
    702
  • Abstract
    In this study we extend the nonstationary-state (trended) HMM from the single-trend formulation of Deng (see Signal Processing, vo1.27, no.1, p. 65-78, 1992) to the mixture-trend one. This extension is motivated by the observation of wide variations in the trajectories of the acoustic data in fluent, speaker-independent speech associated with a given underlying linguistic unit. We show how HMMs with mixtures of trend functions can be implemented simply in the already well established singly trended HMM framework via the device of expanding each state into a set of parallel states. Details of a maximum-likelihood based algorithm are given for estimating state-dependent mixture trajectory parameters in the model. Experimental results on the task of classifying speaker-independent vowels excised from TIMIT database demonstrate consistent performance improvement using phonemic mixture-trended HMMs over their singly-trended counterpart
  • Keywords
    acoustic signal processing; hidden Markov models; maximum likelihood estimation; parameter estimation; speech analysis and processing; speech recognition; HMM; TIMIT database; acoustic data; automatic speech recognition; experimental results; linguistic unit; maximum-likelihood based algorithm; mixture-trend formulation; nonstationary-state; parallel states; parameter estimation; speaker-independent speech; speaker-independent vowels classification; state-dependent mixture trajectory parameters; trend functions; Acoustic devices; Automatic speech recognition; Gaussian processes; Hidden Markov models; Loudspeakers; Maximum likelihood estimation; Polynomials; Speech recognition; Stochastic processes; Underwater acoustics;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Speech, Image Processing and Neural Networks, 1994. Proceedings, ISSIPNN '94., 1994 International Symposium on
  • Print_ISBN
    0-7803-1865-X
  • Type

    conf

  • DOI
    10.1109/SIPNN.1994.344815
  • Filename
    344815