Title :
Speech Emotion Recognition using non-linear Teager energy based features in noisy environments
Author :
Georgogiannis, Alexandros ; Digalakis, Vassilis
Author_Institution :
Dept. of Electr. & Comput. Eng., Tech. Univ. of Crete, Chania, Greece
Abstract :
In this study, Teager-energy based Mel-frequency cepstral coefficients (TEMFCCs) are proposed for Automatic Speech Emotion Recognition (ASER) in noisy environments. TEMFCCs are obtained by taking the absolute value of the Teager-energy operator (TEO) of the short-time Fourier transform of the signal (STFT), warping it to a Mel-frequency scale, and taking the discrete cosine transform (DCT) of the log-Mel Teager-energy spectrum. Experiments on classification of discrete emotion categories show that TEMFCCs are more robust than MFCCs in noisy conditions, while TEMFCCs and MFCCs perform similarly for clean conditions.
Keywords :
Fourier transforms; cepstral analysis; discrete cosine transforms; emotion recognition; signal classification; speech recognition; ASER; DCT; STFT; TEMFCC; TEO; Teager energy based Mel frequency cepstral coefficient; Teager energy operator; automatic speech emotion recognition; discrete cosine transform; discrete emotion category classification; log-Mel Teager energy spectrum; noisy environment; nonlinear Teager energy; short time Fourier transform; 1f noise; Emotion recognition; Mel frequency cepstral coefficient; Noise measurement; Robustness; Speech; Speech recognition; emotion recognition; nonlinear acoustics; speech analysis;
Conference_Titel :
Signal Processing Conference (EUSIPCO), 2012 Proceedings of the 20th European
Conference_Location :
Bucharest
Print_ISBN :
978-1-4673-1068-0