Title :
Source generator equalization and enhancement of spectral properties for robust speech recognition in noise and stress
Author :
Hansen, John H L ; Clements, Mark A.
Author_Institution :
Dept. of Electr. Eng., Duke Univ., Durham, NC, USA
fDate :
9/1/1995 12:00:00 AM
Abstract :
Studies have shown that depending on speaker task and environmental conditions, recognizers are sensitive to noisy stressful environments. The focus of the study is to achieve robust recognition in diverse environmental conditions through the formulation of feature enhancement and stress equalization algorithms under the framework of source generator theory. The generator framework is considered as a means of modeling production variation under stressful speaking conditions. A multi-dimensional stress equalization procedure is formulated that produces recognition features less sensitive to varying factors caused by stress. A feature enhancement algorithm is employed based on iterative techniques previously derived for enhancement of speech in varying background noise environments. Combined stress equalization and feature enhancement reduces average word error rates across 10 noisy stressful conditions by -38.7% (e.g., noisy loud, angry, and Lombard effect stress conditions, etc.). The results suggest that the combination of a flexible source generator framework to address stressed speaking conditions, and a feature enhancement algorithm that adapts based on speech-specific constraints, can be effective in reducing the consequences of stress and noise for robust automatic recognition
Keywords :
equalisers; error statistics; feature extraction; iterative methods; speech enhancement; speech recognition; enhancement; feature enhancement; iterative techniques; noisy stressful environments; production variation; robust automatic recognition; robust speech recognition; source generator equalization; spectral properties; stress equalization algorithms; word error rates; Automatic speech recognition; Background noise; Error analysis; Iterative algorithms; Noise generators; Noise reduction; Noise robustness; Speech enhancement; Stress; Working environment noise;
Journal_Title :
Speech and Audio Processing, IEEE Transactions on