Source generator equalization and enhancement of spectral properties for robust speech recognition in noise and stress

Author

Hansen, John H L ; Clements, Mark A.

Author_Institution

Dept. of Electr. Eng., Duke Univ., Durham, NC, USA

Volume

3

Issue

5

fYear

1995

fDate

9/1/1995 12:00:00 AM

Firstpage

407

Lastpage

415

Abstract

Studies have shown that depending on speaker task and environmental conditions, recognizers are sensitive to noisy stressful environments. The focus of the study is to achieve robust recognition in diverse environmental conditions through the formulation of feature enhancement and stress equalization algorithms under the framework of source generator theory. The generator framework is considered as a means of modeling production variation under stressful speaking conditions. A multi-dimensional stress equalization procedure is formulated that produces recognition features less sensitive to varying factors caused by stress. A feature enhancement algorithm is employed based on iterative techniques previously derived for enhancement of speech in varying background noise environments. Combined stress equalization and feature enhancement reduces average word error rates across 10 noisy stressful conditions by -38.7% (e.g., noisy loud, angry, and Lombard effect stress conditions, etc.). The results suggest that the combination of a flexible source generator framework to address stressed speaking conditions, and a feature enhancement algorithm that adapts based on speech-specific constraints, can be effective in reducing the consequences of stress and noise for robust automatic recognition

Keywords

equalisers; error statistics; feature extraction; iterative methods; speech enhancement; speech recognition; enhancement; feature enhancement; iterative techniques; noisy stressful environments; production variation; robust automatic recognition; robust speech recognition; source generator equalization; spectral properties; stress equalization algorithms; word error rates; Automatic speech recognition; Background noise; Error analysis; Iterative algorithms; Noise generators; Noise reduction; Noise robustness; Speech enhancement; Stress; Working environment noise;

fLanguage

English

Journal_Title

Speech and Audio Processing, IEEE Transactions on

Publisher

ieee

ISSN

1063-6676

Type

jour

DOI

10.1109/89.466655

Filename

466655