Title :
Stressed speech recognition using multi-dimensional hidden Markov models
Author :
Womack, Brian D. ; Hansen, John H L
Abstract :
Robust speech recognition systems must address variations due to perceptually induced stress in order to maintain acceptable levels of performance in adverse conditions. This study proposes a new approach which combines stress classification and speech recognition into one algorithm. This is accomplished by generalizing the one-dimensional hidden Markov model to a multi-dimensional hidden Markov model (N-D HMM) where each stressed speech style is allocated a dimension in the N-D HMM. It is shown that this formulation better integrates perceptually induced stress effects for stress independent recognition. This is due to the sub-phoneme (state level) stress classification that is implicitly performed by the algorithm. The proposed N-D HMM method is compared to neutral and multi-styled stress trained 1-D HMM recognizers. Average recognition rates are shown to improve by +15.72% over the 1-D stress dependent recognizer and 26.67% over the 1-D neutral trained recognizer
Keywords :
hidden Markov models; performance evaluation; speech recognition; 1D neutral trained recognizer; 1D stress dependent recognizer; multidimensional hidden Markov models; multistyled stress; neutral stress; one-dimensional hidden Markov model; perceptually induced stress; performance; stress classification; stress independent recognition; stressed speech recognition; subphoneme stress classification; Databases; Degradation; Environmental factors; Hidden Markov models; Neural networks; Robustness; Speech analysis; Speech processing; Speech recognition; Stress;
Conference_Titel :
Automatic Speech Recognition and Understanding, 1997. Proceedings., 1997 IEEE Workshop on
Conference_Location :
Santa Barbara, CA
Print_ISBN :
0-7803-3698-4
DOI :
10.1109/ASRU.1997.659117