Modeling gender information for emotion recognition using Denoising autoencoder

Author

Rui Xia ; Jun Deng ; Schuller, Bjorn ; Yang Liu

Author_Institution

Comput. Sci. Dept., Univ. of Texas at Dallas, Dallas, TX, USA

fYear

2014

fDate

4-9 May 2014

Firstpage

990

Lastpage

994

Abstract

The Denoising autoencoder (DAE) has been successfully applied to acoustic emotion recognition lately. In this paper, we adopt the framework of the modified DAE introduced in that projects the input signal to two different hidden representations, for neutral and emotional speech respectively, and uses the emotional representation for the classification task. We propose to model gender information for more robust emotional representation in this work. For neutral representation, male and female dependent DAEs are built using non-emotional speech with the aim of capturing distinct information between the two genders. The emotional hidden representation is shared for the two genders in order to model more emotion specific characteristics, and is used as features in a back-end classifier for emotion recognition. We propose different optimization objectives in training the DAEs. Our experimental results show improvement on unweighted accuracy compared with previous work using the modified DAE method and the classifiers using the standard static features. Further performance gain can be achieved by structural level system combination.

Keywords

emotion recognition; image classification; image coding; image denoising; image representation; speech coding; speech recognition; DAE; acoustic emotion recognition; back-end classifier; denoising autoencoder; emotional classification; emotional hidden representation; emotional speech representation; gender information modeling; neutral speech representation; optimization; signal representation; structural level system combination; Emotion recognition; Feature extraction; Noise reduction; Speech; Speech recognition; Training; Denoising autoencoder; Emotion recognition; Gender;

fLanguage

English

Publisher

ieee

Conference_Titel

Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on

Conference_Location

Florence

Type

conf

DOI

10.1109/ICASSP.2014.6853745

Filename

6853745