• DocumentCode
    177773
  • Title

    Modeling gender information for emotion recognition using Denoising autoencoder

  • Author

    Rui Xia ; Jun Deng ; Schuller, Bjorn ; Yang Liu

  • Author_Institution
    Comput. Sci. Dept., Univ. of Texas at Dallas, Dallas, TX, USA
  • fYear
    2014
  • fDate
    4-9 May 2014
  • Firstpage
    990
  • Lastpage
    994
  • Abstract
    The Denoising autoencoder (DAE) has been successfully applied to acoustic emotion recognition lately. In this paper, we adopt the framework of the modified DAE introduced in that projects the input signal to two different hidden representations, for neutral and emotional speech respectively, and uses the emotional representation for the classification task. We propose to model gender information for more robust emotional representation in this work. For neutral representation, male and female dependent DAEs are built using non-emotional speech with the aim of capturing distinct information between the two genders. The emotional hidden representation is shared for the two genders in order to model more emotion specific characteristics, and is used as features in a back-end classifier for emotion recognition. We propose different optimization objectives in training the DAEs. Our experimental results show improvement on unweighted accuracy compared with previous work using the modified DAE method and the classifiers using the standard static features. Further performance gain can be achieved by structural level system combination.
  • Keywords
    emotion recognition; image classification; image coding; image denoising; image representation; speech coding; speech recognition; DAE; acoustic emotion recognition; back-end classifier; denoising autoencoder; emotional classification; emotional hidden representation; emotional speech representation; gender information modeling; neutral speech representation; optimization; signal representation; structural level system combination; Emotion recognition; Feature extraction; Noise reduction; Speech; Speech recognition; Training; Denoising autoencoder; Emotion recognition; Gender;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
  • Conference_Location
    Florence
  • Type

    conf

  • DOI
    10.1109/ICASSP.2014.6853745
  • Filename
    6853745