• DocumentCode
    592054
  • Title

    Multimodal Information Fusion of Audio Emotion Recognition Based on Kernel Entropy Component Analysis

  • Author

    Zhibing Xie ; Ling Guan

  • Author_Institution
    Ryerson Multimedia Res. Lab., Ryerson Univ., Toronto, ON, Canada
  • fYear
    2012
  • fDate
    10-12 Dec. 2012
  • Firstpage
    1
  • Lastpage
    8
  • Abstract
    This paper focuses on the application of novel information theoretic tools in the area of information fusion. Feature transformation and fusion is critical for the performance of information fusion, however the majority of the existing works depend on the second order statistics, which is only optimal for Gaussian-like distribution. In this paper, the integration of information fusion techniques and kernel entropy component analysis provides a new information theoretic tool. The fusion of features is realized using descriptor of information entropy and optimized by entropy estimation. A novel multimodal information fusion strategy of audio emotion recognition based on kernel entropy component analysis (KECA) has been presented. The effectiveness of the proposed solution is evaluated though experimentation on two audiovisual emotion databases. Experimental results show that the proposed solution outperforms the existing methods, especially when the dimension of feature space is substantially reduced. The proposed method offers general theoretical analysis which gives us an approach to implement information theory into multimedia research.
  • Keywords
    Gaussian distribution; audio signal processing; audio-visual systems; emotion recognition; entropy; feature extraction; principal component analysis; sensor fusion; statistical analysis; Gaussian distribution; KECA; audio emotion recognition; audiovisual emotion database; entropy estimation; feature space; feature transformation; information entropy; information theory tool; kernel entropy component analysis; multimodal information fusion; optimization; second order statistics; Eigenvalues and eigenfunctions; Emotion recognition; Entropy; Estimation; Feature extraction; Kernel; Speech; Emotion recognition; Kernel entropy component analysis; Multimodal information fusion;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Multimedia (ISM), 2012 IEEE International Symposium on
  • Conference_Location
    Irvine, CA
  • Print_ISBN
    978-1-4673-4370-1
  • Type

    conf

  • DOI
    10.1109/ISM.2012.9
  • Filename
    6424622