• DocumentCode
    2899257
  • Title

    Detection of stress and emotion in speech using traditional and FFT based log energy features

  • Author

    Nwe, T.L. ; Foo, S.W. ; De Silva, L.c.

  • Author_Institution
    Dept. of Electr. & Comput. Eng., Nat. Univ. of Singapore, Singapore
  • Volume
    3
  • fYear
    2003
  • fDate
    15-18 Dec. 2003
  • Firstpage
    1619
  • Abstract
    In this paper, a novel system for detection of human stress and emotion in speech is proposed. The system makes use of FFT based linear short time log frequency power coefficients (LFPC) and TEO based nonlinear LFPC features in both time and frequency domains. The performance of the proposed system is compared with the traditional approaches which use features of LPCC and MFCC. The comparison of each approach is performed using SUSAS (speech under simulated and actual stress) and ESMBS (emotional speech of Mandarin and Burmese speakers) databases. It is observed that proposed system outperforms the traditional systems. Results show that, the system using LFPC gives the highest accuracy (87.8% for stress, 89.2% for emotion classification) followed by the system using NFD-LFPC feature. While the system using NTD-LFPC feature gives the lowest accuracy.
  • Keywords
    emotion recognition; fast Fourier transforms; pattern classification; psychology; speech recognition; FFT based log energy features; emotion classification; emotional speech of Mandarin and Burmese speakers; fast Fourier transform; frequency domain; human emotion classification; human stress classification; linear acoustic features; log frequency power coefficients; nonlinear acoustic features; speech detection; speech under simulated and actual stress; time domain; Acoustic signal detection; Computer interfaces; Educational institutions; Hidden Markov models; Humans; Mel frequency cepstral coefficient; Power engineering and energy; Speech enhancement; Speech recognition; Stress;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Information, Communications and Signal Processing, 2003 and Fourth Pacific Rim Conference on Multimedia. Proceedings of the 2003 Joint Conference of the Fourth International Conference on
  • Print_ISBN
    0-7803-8185-8
  • Type

    conf

  • DOI
    10.1109/ICICS.2003.1292741
  • Filename
    1292741