DocumentCode
3350503
Title
Classification of speech under stress based on features derived from the nonlinear Teager energy operator
Author
Zhou, Guojun ; Hansen, John H L ; Kaiser, James F.
Author_Institution
Robust Speech Process. Lab., Duke Univ., Durham, NC, USA
Volume
1
fYear
1998
fDate
12-15 May 1998
Firstpage
549
Abstract
Studies have shown that the distortion introduced by stress or emotion can severely reduce speech recognition accuracy. Techniques for detecting or assessing the presence of stress could help neutralize stressed speech and improve the robustness of speech recognition systems. Although some acoustic variables derived from linear speech production theory have been investigated as indicators of stress, they are not consistent. Three new features derived from the nonlinear Teager (1990) energy operator (TEO) are investigated for stress assessment and classification. It is believed that TEO based features are better able to reflect the nonlinear airflow structure of speech production under adverse stressful conditions. The proposed features outperform stress classification using traditional pitch by +22.5% for the normalized TEO autocorrelation envelope area feature (TEO-Auto-Env), and by +28.8% for TEO based pitch feature (TEO-Pitch). Overall neutral/stress classification rates are more consistent for TEO based features (TEO-Auto-Env: σ=5.15, TEO-pitch: σ=7.83) vs. (pitch: σ=23.40). Also, evaluation results using actual emergency aircraft cockpit stressed speech from NATO show that TEO-Auto-Env works best for stress assessment
Keywords
acoustic signal processing; correlation methods; feature extraction; mathematical operators; pattern classification; speech recognition; NATO; acoustic variables; emergency aircraft cockpit stressed speech; emotion; linear speech production theory; neutral/stress classification rates; nonlinear Teager energy operator; nonlinear airflow structure; normalized TEO autocorrelation envelope area feature; pitch feature; speech classification; speech features; speech production; speech recognition accuracy; speech recognition systems; stress assessment; stress classification; stressed speech; Autocorrelation; Background noise; Human factors; Laboratories; Loudspeakers; Robustness; Speech analysis; Speech processing; Speech recognition; Stress;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing, 1998. Proceedings of the 1998 IEEE International Conference on
Conference_Location
Seattle, WA
ISSN
1520-6149
Print_ISBN
0-7803-4428-6
Type
conf
DOI
10.1109/ICASSP.1998.674489
Filename
674489
Link To Document