• DocumentCode
    1855886
  • Title

    Speech Emotion Recognition using non-linear Teager energy based features in noisy environments

  • Author

    Georgogiannis, Alexandros ; Digalakis, Vassilis

  • Author_Institution
    Dept. of Electr. & Comput. Eng., Tech. Univ. of Crete, Chania, Greece
  • fYear
    2012
  • fDate
    27-31 Aug. 2012
  • Firstpage
    2045
  • Lastpage
    2049
  • Abstract
    In this study, Teager-energy based Mel-frequency cepstral coefficients (TEMFCCs) are proposed for Automatic Speech Emotion Recognition (ASER) in noisy environments. TEMFCCs are obtained by taking the absolute value of the Teager-energy operator (TEO) of the short-time Fourier transform of the signal (STFT), warping it to a Mel-frequency scale, and taking the discrete cosine transform (DCT) of the log-Mel Teager-energy spectrum. Experiments on classification of discrete emotion categories show that TEMFCCs are more robust than MFCCs in noisy conditions, while TEMFCCs and MFCCs perform similarly for clean conditions.
  • Keywords
    Fourier transforms; cepstral analysis; discrete cosine transforms; emotion recognition; signal classification; speech recognition; ASER; DCT; STFT; TEMFCC; TEO; Teager energy based Mel frequency cepstral coefficient; Teager energy operator; automatic speech emotion recognition; discrete cosine transform; discrete emotion category classification; log-Mel Teager energy spectrum; noisy environment; nonlinear Teager energy; short time Fourier transform; 1f noise; Emotion recognition; Mel frequency cepstral coefficient; Noise measurement; Robustness; Speech; Speech recognition; emotion recognition; nonlinear acoustics; speech analysis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal Processing Conference (EUSIPCO), 2012 Proceedings of the 20th European
  • Conference_Location
    Bucharest
  • ISSN
    2219-5491
  • Print_ISBN
    978-1-4673-1068-0
  • Type

    conf

  • Filename
    6334230