• DocumentCode
    3162220
  • Title

    Analyzing the memory of BLSTM Neural Networks for enhanced emotion classification in dyadic spoken interactions

  • Author

    Wöllmer, Martin ; Metallinou, Angeliki ; Katsamanis, Nassos ; Schuller, Björn ; Narayanan, Shrikanth

  • Author_Institution
    Inst. for Human-Machine Commun., Tech. Univ. Munchen, München, Germany
  • fYear
    2012
  • fDate
    25-30 March 2012
  • Firstpage
    4157
  • Lastpage
    4160
  • Abstract
    Recent studies indicate that bidirectional Long Short-Term Memory (BLSTM) recurrent neural networks are well-suited for automatic emotion recognition systems and may lead to better results than systems applying other widely used classifiers such as Support Vector Machines or feedforward Neural Networks. The good performance of BLSTM emotion recognition systems could be attributed to their ability to model and exploit contextual information self-learned via recurrently connected memory blocks which allows them to incorporate information about how emotion evolves over time. However, the actual amount of bidirectional context that a BLSTM classifier takes into account when classifying an observation has not been investigated so far. This paper presents a methodology to systematically investigate the number of past and future utterance-level observations that are considered to generate an emotion prediction for a given utterance, and to examine to what extent this temporal bidirectional context contributes to the overall BLSTM performance.
  • Keywords
    emotion recognition; recurrent neural nets; speech recognition; BLSTM neural network memory; automatic emotion recognition systems; bidirectional long short term memory; contextual information; dyadic spoken interaction; enhanced emotion classification; recurrent neural network; utterance level observation; Context; Context modeling; Emotion recognition; Feature extraction; Hidden Markov models; Recurrent neural networks; Long Short-Term Memory; context modeling; emotion recognition; sequential Jacobian;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
  • Conference_Location
    Kyoto
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4673-0045-2
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2012.6288834
  • Filename
    6288834