مرکز منطقه ای اطلاع رساني علوم و فناوري - Audio–Visual Affective Expression Recognition Through Multistream Fused HMM

DocumentCode :

1221919

Title :

Audio–Visual Affective Expression Recognition Through Multistream Fused HMM

Author :

Zeng, Zhihong ; Tu, Jilin ; Pianfetti, Brian M. ; Huang, Thomas S.

Author_Institution :

Beckman Inst., Univ. of Illinois at Urbana-Champaign (UIUC), Urbana, IL

Volume :

Issue :

fYear :

2008

fDate :

6/1/2008 12:00:00 AM

Firstpage :

570

Lastpage :

577

Abstract :

Advances in computer processing power and emerging algorithms are allowing new ways of envisioning human-computer interaction. Although the benefit of audio-visual fusion is expected for affect recognition from the psychological and engineering perspectives, most of existing approaches to automatic human affect analysis are unimodal: information processed by computer system is limited to either face images or the speech signals. This paper focuses on the development of a computing algorithm that uses both audio and visual sensors to detect and track a user´s affective state to aid computer decision making. Using our multistream fused hidden Markov model (MFHMM), we analyzed coupled audio and visual streams to detect four cognitive states (interest, boredom, frustration and puzzlement) and seven prototypical emotions (neural, happiness, sadness, anger, disgust, fear and surprise). The MFHMM allows the building of an optimal connection among multiple streams according to the maximum entropy principle and the maximum mutual information criterion. Person-independent experimental results from 20 subjects in 660 sequences show that the MFHMM approach outperforms face-only HMM, pitch-only HMM, energy-only HMM, and independent HMM fusion, under clean and varying audio channel noise condition.

Keywords :

emotion recognition; face recognition; hidden Markov models; human computer interaction; speech recognition; audio-visual affective expression recognition; automatic human affect analysis; computer decision making; hidden Markov model; human-computer interaction; maximum entropy principle; Automatic speech recognition; Face recognition; Hidden Markov models; Humans; Image recognition; Power engineering and energy; Power engineering computing; Psychology; Speech analysis; Streaming media; Affective computing; affect recognition; emotion recognition; human computing; human–computer interaction; multimodal fusion;

fLanguage :

English

Journal_Title :

Multimedia, IEEE Transactions on

Publisher :

ieee

ISSN :

1520-9210

Type :

jour

DOI :

10.1109/TMM.2008.921737

Filename :

4523967

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1221919