DocumentCode
177773
Title
Modeling gender information for emotion recognition using Denoising autoencoder
Author
Rui Xia ; Jun Deng ; Schuller, Bjorn ; Yang Liu
Author_Institution
Comput. Sci. Dept., Univ. of Texas at Dallas, Dallas, TX, USA
fYear
2014
fDate
4-9 May 2014
Firstpage
990
Lastpage
994
Abstract
The Denoising autoencoder (DAE) has been successfully applied to acoustic emotion recognition lately. In this paper, we adopt the framework of the modified DAE introduced in that projects the input signal to two different hidden representations, for neutral and emotional speech respectively, and uses the emotional representation for the classification task. We propose to model gender information for more robust emotional representation in this work. For neutral representation, male and female dependent DAEs are built using non-emotional speech with the aim of capturing distinct information between the two genders. The emotional hidden representation is shared for the two genders in order to model more emotion specific characteristics, and is used as features in a back-end classifier for emotion recognition. We propose different optimization objectives in training the DAEs. Our experimental results show improvement on unweighted accuracy compared with previous work using the modified DAE method and the classifiers using the standard static features. Further performance gain can be achieved by structural level system combination.
Keywords
emotion recognition; image classification; image coding; image denoising; image representation; speech coding; speech recognition; DAE; acoustic emotion recognition; back-end classifier; denoising autoencoder; emotional classification; emotional hidden representation; emotional speech representation; gender information modeling; neutral speech representation; optimization; signal representation; structural level system combination; Emotion recognition; Feature extraction; Noise reduction; Speech; Speech recognition; Training; Denoising autoencoder; Emotion recognition; Gender;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
Conference_Location
Florence
Type
conf
DOI
10.1109/ICASSP.2014.6853745
Filename
6853745
Link To Document