Title :
A stochastic model of speech incorporating hierarchical nonstationarity
Author_Institution :
Dept. of Electr. & Comput. Eng., Waterloo Univ., Ont., Canada
fDate :
10/1/1993 12:00:00 AM
Abstract :
The concept of two-level (global and local) hierarchical nonstationarity is introduced to describe the elastic and dynamic nature of the speech signal. A doubly stochastic process model is developed to implement this concept. In the model, the global nonstationarity is embodied through an underlying Markov chain that governs evolution of the parameters in a set of output stochastic processes. The local nonstationarity is realized by utilizing state-conditioned, time-varying first- and second-order statistics in the output data-generation process models. For potential uses in automatic uncovering of relationally invariant properties from the speech signal and in speech recognition, the local nonstationarity is represented in a parametric form. Preliminary experiments on fitting the models to speech data demonstrate superior performances of the proposed model to several traditional types of hidden Markov models
Keywords :
parameter estimation; speech analysis and processing; speech recognition; stochastic processes; Markov chain; automatic uncovering; doubly stochastic process model; dynamic nature; elastic nature; global nonstationarity; hierarchical nonstationarity; local nonstationarity; output data-generation process models; output stochastic processes; relationally invariant properties; speech recognition; speech signal; state-conditioned time-varying statistics; stochastic model; Equations; Filters; Hidden Markov models; Mathematics; Natural languages; Parametric statistics; Speech recognition; Stochastic processes;
Journal_Title :
Speech and Audio Processing, IEEE Transactions on