• DocumentCode
    3350503
  • Title

    Classification of speech under stress based on features derived from the nonlinear Teager energy operator

  • Author

    Zhou, Guojun ; Hansen, John H L ; Kaiser, James F.

  • Author_Institution
    Robust Speech Process. Lab., Duke Univ., Durham, NC, USA
  • Volume
    1
  • fYear
    1998
  • fDate
    12-15 May 1998
  • Firstpage
    549
  • Abstract
    Studies have shown that the distortion introduced by stress or emotion can severely reduce speech recognition accuracy. Techniques for detecting or assessing the presence of stress could help neutralize stressed speech and improve the robustness of speech recognition systems. Although some acoustic variables derived from linear speech production theory have been investigated as indicators of stress, they are not consistent. Three new features derived from the nonlinear Teager (1990) energy operator (TEO) are investigated for stress assessment and classification. It is believed that TEO based features are better able to reflect the nonlinear airflow structure of speech production under adverse stressful conditions. The proposed features outperform stress classification using traditional pitch by +22.5% for the normalized TEO autocorrelation envelope area feature (TEO-Auto-Env), and by +28.8% for TEO based pitch feature (TEO-Pitch). Overall neutral/stress classification rates are more consistent for TEO based features (TEO-Auto-Env: σ=5.15, TEO-pitch: σ=7.83) vs. (pitch: σ=23.40). Also, evaluation results using actual emergency aircraft cockpit stressed speech from NATO show that TEO-Auto-Env works best for stress assessment
  • Keywords
    acoustic signal processing; correlation methods; feature extraction; mathematical operators; pattern classification; speech recognition; NATO; acoustic variables; emergency aircraft cockpit stressed speech; emotion; linear speech production theory; neutral/stress classification rates; nonlinear Teager energy operator; nonlinear airflow structure; normalized TEO autocorrelation envelope area feature; pitch feature; speech classification; speech features; speech production; speech recognition accuracy; speech recognition systems; stress assessment; stress classification; stressed speech; Autocorrelation; Background noise; Human factors; Laboratories; Loudspeakers; Robustness; Speech analysis; Speech processing; Speech recognition; Stress;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 1998. Proceedings of the 1998 IEEE International Conference on
  • Conference_Location
    Seattle, WA
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-4428-6
  • Type

    conf

  • DOI
    10.1109/ICASSP.1998.674489
  • Filename
    674489