Title :
Modeling speech variability with segmental HMMs
Author :
Holmes, Wendy J. ; Russell, Martin J.
Author_Institution :
Speech Res. Unit, DRA Malvern, UK
Abstract :
An important characteristic of segmental hidden Markov models (HMMs) is a distinction between extra-segmental variability across different examples of a sub-phonemic speech segment and intra-segmental variability within any one example. This paper considers the modeling of these two types of variability in some detail. Analysis of natural speech data has suggested that intra-segmental variability of mel cepstrum features is not well-approximated by single Gaussians. The model of this variability was improved by introducing a two-component Gaussian mixture, where the two components have the same mean but one has much smaller variance than the other. Initial recognition experiments using models with three segments per phone have shown the importance of accurately modeling intra-segmental variability, in order for the intra- and extra-segmental probabilities to balance correctly and hence for good recognition performance to be achieved
Keywords :
Gaussian processes; cepstral analysis; hidden Markov models; speech recognition; extra-segmental variability; intra-segmental variability; mel cepstrum features; natural speech data; segmental HMM; segmental hidden Markov models; speech recognition experiments; speech variability; sub-phonemic speech segment; two-component Gaussian mixture; variance; Cepstral analysis; Cepstrum; Difference equations; Differential equations; Gaussian processes; Hidden Markov models; Natural languages; Speech analysis; Stochastic processes; Vectors;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1996. ICASSP-96. Conference Proceedings., 1996 IEEE International Conference on
Conference_Location :
Atlanta, GA
Print_ISBN :
0-7803-3192-3
DOI :
10.1109/ICASSP.1996.541129