DocumentCode
1855886
Title
Speech Emotion Recognition using non-linear Teager energy based features in noisy environments
Author
Georgogiannis, Alexandros ; Digalakis, Vassilis
Author_Institution
Dept. of Electr. & Comput. Eng., Tech. Univ. of Crete, Chania, Greece
fYear
2012
fDate
27-31 Aug. 2012
Firstpage
2045
Lastpage
2049
Abstract
In this study, Teager-energy based Mel-frequency cepstral coefficients (TEMFCCs) are proposed for Automatic Speech Emotion Recognition (ASER) in noisy environments. TEMFCCs are obtained by taking the absolute value of the Teager-energy operator (TEO) of the short-time Fourier transform of the signal (STFT), warping it to a Mel-frequency scale, and taking the discrete cosine transform (DCT) of the log-Mel Teager-energy spectrum. Experiments on classification of discrete emotion categories show that TEMFCCs are more robust than MFCCs in noisy conditions, while TEMFCCs and MFCCs perform similarly for clean conditions.
Keywords
Fourier transforms; cepstral analysis; discrete cosine transforms; emotion recognition; signal classification; speech recognition; ASER; DCT; STFT; TEMFCC; TEO; Teager energy based Mel frequency cepstral coefficient; Teager energy operator; automatic speech emotion recognition; discrete cosine transform; discrete emotion category classification; log-Mel Teager energy spectrum; noisy environment; nonlinear Teager energy; short time Fourier transform; 1f noise; Emotion recognition; Mel frequency cepstral coefficient; Noise measurement; Robustness; Speech; Speech recognition; emotion recognition; nonlinear acoustics; speech analysis;
fLanguage
English
Publisher
ieee
Conference_Titel
Signal Processing Conference (EUSIPCO), 2012 Proceedings of the 20th European
Conference_Location
Bucharest
ISSN
2219-5491
Print_ISBN
978-1-4673-1068-0
Type
conf
Filename
6334230
Link To Document