• DocumentCode
    118342
  • Title

    Multimodal continuous affect recognition based on LSTM and multiple kernel learning

  • Author

    Jiamei Wei ; Ercheng Pei ; Dongmei Jiang ; Sahli, Hichem ; Lei Xie ; Zhonghua Fu

  • Author_Institution
    VUB-NPU Joint AVSP Res. Lab., Northwestern Polytech. Univ., Xian, China
  • fYear
    2014
  • fDate
    9-12 Dec. 2014
  • Firstpage
    1
  • Lastpage
    4
  • Abstract
    In this paper, we propose a Long Short-Term Memory Recurrent Neural Network (LSTM-RNN) and multiple kernel learning (MKL) based multi-modal affect recognition scheme (LSTM-MKL). It takes the LSTM-RNN advantage to model the long range dependencies between successive observations, and uses the MKL power to model the non-linear correlations between the inputs and outputs. For each of the affect dimensions (arousal, valence, expectancy, and power), two LSTM-RNN models are trained, one for each modality. In the recognition phase, the audio and visual features are input to the corresponding learned LSTM models, which in turn produce initial estimates of the affect dimensions. The LSTM outputs are further input into a multi-kernel support vector regression (MK-SVR) for the final recognition. Experimental results carried out on the AVEC2012 database, show that compared to the traditional SVR-LLR (Support Vector Machine - local linear regression) or MK-SVR fusion scheme, the proposed LSTM-MKL fusion scheme obtains higher recognition results, with an correlation coefficient (COR) of 0.354, compared to a COR of 0.124 for SVR-LLR, and 0.168 for MK-SVR, respectively.
  • Keywords
    correlation methods; emotion recognition; feature extraction; image recognition; neural nets; regression analysis; speech recognition; support vector machines; LSTM-MKL fusion; audio recognition; feature recognition; long short-term memory recurrent neural network; multikernel support vector regression; multimodal affect recognition scheme; multimodal continuous affect recognition; multiple kernel learning; nonlinear correlation; visual recognition; Abstracts; Correlation; Decision support systems; Kernel; Recurrent neural networks; Support vector machines; Visualization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Asia-Pacific Signal and Information Processing Association, 2014 Annual Summit and Conference (APSIPA)
  • Conference_Location
    Siem Reap
  • Type

    conf

  • DOI
    10.1109/APSIPA.2014.7041743
  • Filename
    7041743