• DocumentCode
    1347425
  • Title

    Error Weighted Semi-Coupled Hidden Markov Model for Audio-Visual Emotion Recognition

  • Author

    Lin, Jen-Chun ; Wu, Chung-Hsien ; Wei, Wen-Li

  • Author_Institution
    Dept. of Comput. Sci. & Inf. Eng., Nat. Cheng Kung Univ., Tainan, Taiwan
  • Volume
    14
  • Issue
    1
  • fYear
    2012
  • Firstpage
    142
  • Lastpage
    156
  • Abstract
    This paper presents an approach to the automatic recognition of human emotions from audio-visual bimodal signals using an error weighted semi-coupled hidden Markov model (EWSC-HMM). The proposed approach combines an SC-HMM with a state-based bimodal alignment strategy and a Bayesian classifier weighting scheme to obtain the optimal emotion recognition result based on audio-visual bimodal fusion. The state-based bimodal alignment strategy in SC-HMM is proposed to align the temporal relation between audio and visual streams. The Bayesian classifier weighting scheme is then adopted to explore the contributions of the SC-HMM-based classifiers for different audio-visual feature pairs in order to obtain the emotion recognition output. For performance evaluation, two databases are considered: the MHMC posed database and the SEMAINE naturalistic database. Experimental results show that the proposed approach not only outperforms other fusion-based bimodal emotion recognition methods for posed expressions but also provides satisfactory results for naturalistic expressions.
  • Keywords
    Bayes methods; audio streaming; audio-visual systems; emotion recognition; feature extraction; hidden Markov models; image classification; image fusion; video streaming; visual databases; Bayesian classifier weighting scheme; EWSC-HMM; SC-HMM based classifier; SEMAINE naturalistic database; audio stream; audio visual bimodal fusion; audio visual feature pair; automatic recognition; error weighted semicoupled hidden Markov model; optimal human emotion recognition; performance evaluation; state based bimodal alignment strategy; temporal relation; visual stream; Correlation; Databases; Emotion recognition; Hidden Markov models; Humans; Speech; Visualization; Audio-visual bimodal fusion; emotion recognition; semi-coupled hidden Markov model (SC-HMM);
  • fLanguage
    English
  • Journal_Title
    Multimedia, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1520-9210
  • Type

    jour

  • DOI
    10.1109/TMM.2011.2171334
  • Filename
    6042338