• DocumentCode
    1184251
  • Title

    Speech recognition using hidden Markov models with polynomial regression functions as nonstationary states

  • Author

    Deng, Li ; Aksmanovic, Mike ; Sun, Xiaodong ; Wu, C. F Jeff

  • Author_Institution
    Dept. of Electr. & Comput. Eng., Waterloo Univ., Ont., Canada
  • Volume
    2
  • Issue
    4
  • fYear
    1994
  • fDate
    10/1/1994 12:00:00 AM
  • Firstpage
    507
  • Lastpage
    520
  • Abstract
    Proposes, implements, and evaluates a class of nonstationary-state hidden Markov models (HMMs) having each state associated with a distinct polynomial regression function of time plus white Gaussian noise. The model represents the transitional acoustic trajectories of speech in a parametric manner, and includes the standard stationary-state HMM as a special, degenerated case. The authors develop an efficient dynamic programming technique which includes the state sojourn time as an optimization variable, in conjunction with a state-dependent orthogonal polynomial regression method, for estimating the model parameters. Experiments on fitting models to speech data and on limited-vocabulary speech recognition demonstrate consistent superiority of these nonstationary-state HMMs over the traditional stationary-state HMMs
  • Keywords
    dynamic programming; hidden Markov models; maximum likelihood estimation; parameter estimation; polynomials; random noise; speech recognition; statistical analysis; white noise; dynamic programming; fitting models; hidden Markov models; limited-vocabulary speech recognition; nonstationary states; optimization; polynomial regression functions; speech data; standard stationary-state HMM; state sojourn time; state-dependent orthogonal polynomial regression method; time; transitional acoustic trajectories; white Gaussian noise; Covariance matrix; Gaussian noise; Hidden Markov models; Optimization methods; Parameter estimation; Polynomials; Speech recognition; State estimation; Stochastic processes; Sun;
  • fLanguage
    English
  • Journal_Title
    Speech and Audio Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1063-6676
  • Type

    jour

  • DOI
    10.1109/89.326610
  • Filename
    326610