Error Weighted Semi-Coupled Hidden Markov Model for Audio-Visual Emotion Recognition

Author

Lin, Jen-Chun ; Wu, Chung-Hsien ; Wei, Wen-Li

Author_Institution

Dept. of Comput. Sci. & Inf. Eng., Nat. Cheng Kung Univ., Tainan, Taiwan

Volume

14

Issue

1

fYear

2012

Firstpage

142

Lastpage

156

Abstract

This paper presents an approach to the automatic recognition of human emotions from audio-visual bimodal signals using an error weighted semi-coupled hidden Markov model (EWSC-HMM). The proposed approach combines an SC-HMM with a state-based bimodal alignment strategy and a Bayesian classifier weighting scheme to obtain the optimal emotion recognition result based on audio-visual bimodal fusion. The state-based bimodal alignment strategy in SC-HMM is proposed to align the temporal relation between audio and visual streams. The Bayesian classifier weighting scheme is then adopted to explore the contributions of the SC-HMM-based classifiers for different audio-visual feature pairs in order to obtain the emotion recognition output. For performance evaluation, two databases are considered: the MHMC posed database and the SEMAINE naturalistic database. Experimental results show that the proposed approach not only outperforms other fusion-based bimodal emotion recognition methods for posed expressions but also provides satisfactory results for naturalistic expressions.

Keywords

Bayes methods; audio streaming; audio-visual systems; emotion recognition; feature extraction; hidden Markov models; image classification; image fusion; video streaming; visual databases; Bayesian classifier weighting scheme; EWSC-HMM; SC-HMM based classifier; SEMAINE naturalistic database; audio stream; audio visual bimodal fusion; audio visual feature pair; automatic recognition; error weighted semicoupled hidden Markov model; optimal human emotion recognition; performance evaluation; state based bimodal alignment strategy; temporal relation; visual stream; Correlation; Databases; Emotion recognition; Hidden Markov models; Humans; Speech; Visualization; Audio-visual bimodal fusion; emotion recognition; semi-coupled hidden Markov model (SC-HMM);

fLanguage

English

Journal_Title

Multimedia, IEEE Transactions on

Publisher

ieee

ISSN

1520-9210

Type

jour

DOI

10.1109/TMM.2011.2171334

Filename

6042338