DocumentCode :
3350503
Title :
Classification of speech under stress based on features derived from the nonlinear Teager energy operator
Author :
Zhou, Guojun ; Hansen, John H L ; Kaiser, James F.
Author_Institution :
Robust Speech Process. Lab., Duke Univ., Durham, NC, USA
Volume :
1
fYear :
1998
fDate :
12-15 May 1998
Firstpage :
549
Abstract :
Studies have shown that the distortion introduced by stress or emotion can severely reduce speech recognition accuracy. Techniques for detecting or assessing the presence of stress could help neutralize stressed speech and improve the robustness of speech recognition systems. Although some acoustic variables derived from linear speech production theory have been investigated as indicators of stress, they are not consistent. Three new features derived from the nonlinear Teager (1990) energy operator (TEO) are investigated for stress assessment and classification. It is believed that TEO based features are better able to reflect the nonlinear airflow structure of speech production under adverse stressful conditions. The proposed features outperform stress classification using traditional pitch by +22.5% for the normalized TEO autocorrelation envelope area feature (TEO-Auto-Env), and by +28.8% for TEO based pitch feature (TEO-Pitch). Overall neutral/stress classification rates are more consistent for TEO based features (TEO-Auto-Env: σ=5.15, TEO-pitch: σ=7.83) vs. (pitch: σ=23.40). Also, evaluation results using actual emergency aircraft cockpit stressed speech from NATO show that TEO-Auto-Env works best for stress assessment
Keywords :
acoustic signal processing; correlation methods; feature extraction; mathematical operators; pattern classification; speech recognition; NATO; acoustic variables; emergency aircraft cockpit stressed speech; emotion; linear speech production theory; neutral/stress classification rates; nonlinear Teager energy operator; nonlinear airflow structure; normalized TEO autocorrelation envelope area feature; pitch feature; speech classification; speech features; speech production; speech recognition accuracy; speech recognition systems; stress assessment; stress classification; stressed speech; Autocorrelation; Background noise; Human factors; Laboratories; Loudspeakers; Robustness; Speech analysis; Speech processing; Speech recognition; Stress;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 1998. Proceedings of the 1998 IEEE International Conference on
Conference_Location :
Seattle, WA
ISSN :
1520-6149
Print_ISBN :
0-7803-4428-6
Type :
conf
DOI :
10.1109/ICASSP.1998.674489
Filename :
674489
Link To Document :
بازگشت