مرکز منطقه ای اطلاع رساني علوم و فناوري - A multi-modal approach to emotion recognition using undirected topic models

DocumentCode :

1768534

Title :

A multi-modal approach to emotion recognition using undirected topic models

Author :

Shah, Mubarak ; Chakrabarti, Chaitali ; Spanias, A.

Author_Institution :

Sch. of Electr., Arizona State Univ., Tempe, AZ, USA

fYear :

2014

fDate :

1-5 June 2014

Firstpage :

754

Lastpage :

757

Abstract :

A multi-modal framework for emotion recognition using bag-of-words features and undirected, replicated softmax topic models is proposed here. Topic models ignore the temporal information between features, allowing them to capture the complex structure without a brute-force collection of statistics. Experiments are performed over face, speech and language features extracted from the USC IEMOCAP database. Performance on facial features yields an unweighted average recall of 60.71%, a relative improvement of 8.89% over state-of-the-art approaches. A comparable performance is achieved when considering only speech (57.39%) or a fusion of speech and face information (66.05%). Individually, each source is shown to be strong at recognizing either sadness (speech) or happiness (face) or neutral (language) emotions, while, a multi-modal fusion retains these properties and improves the accuracy to 68.92%. Implementation time for each source and their combination is provided. Results show that a turn of 1 second duration can be classified in approximately 666.65ms, thus making this method highly amenable for real-time implementation.

Keywords :

emotion recognition; face recognition; feature extraction; natural language processing; sensor fusion; speech recognition; USC IEMOCAP database; bag-of-words features; brute-force collection; complex structure; emotion recognition; face feature extraction; facial feature; happiness recognition; language feature extraction; multimodal approach; multimodal fusion; neutral emotions; sadness recognition; speech feature extraction; speech-face information fusion; statistics; temporal information; undirected replicated softmax topic models; undirected topic models; unweighted average recall; Databases; Emotion recognition; Face; Feature extraction; Hidden Markov models; Speech; Speech recognition;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Circuits and Systems (ISCAS), 2014 IEEE International Symposium on

Conference_Location :

Melbourne VIC

Print_ISBN :

978-1-4799-3431-7

Type :

conf

DOI :

10.1109/ISCAS.2014.6865245

Filename :

6865245

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1768534