DocumentCode
672355
Title
Emotion recognition from spontaneous speech using Hidden Markov models with deep belief networks
Author
Le, Dat ; Provost, Emily Mower
Author_Institution
Comput. Sci. & Eng., Univ. of Michigan, Ann Arbor, MI, USA
fYear
2013
fDate
8-12 Dec. 2013
Firstpage
216
Lastpage
221
Abstract
Research in emotion recognition seeks to develop insights into the temporal properties of emotion. However, automatic emotion recognition from spontaneous speech is challenging due to non-ideal recording conditions and highly ambiguous ground truth labels. Further, emotion recognition systems typically work with noisy high-dimensional data, rendering it difficult to find representative features and train an effective classifier. We tackle this problem by using Deep Belief Networks, which can model complex and non-linear high-level relationships between low-level features. We propose and evaluate a suite of hybrid classifiers based on Hidden Markov Models and Deep Belief Networks. We achieve state-of-the-art results on FAU Aibo, a benchmark dataset in emotion recognition [1]. Our work provides insights into important similarities and differences between speech and emotion.
Keywords
belief networks; emotion recognition; hidden Markov models; speech recognition; deep belief networks; emotion recognition systems; hidden Markov models; hybrid classifiers; low-level features; noisy high-dimensional data; non ideal recording conditions; spontaneous speech; temporal properties; Computer architecture; Context; Emotion recognition; Hidden Markov models; Speech; Speech recognition; Training; FAU Aibo; deep belief networks; dynamic modeling; emotion classification; spontaneous speech;
fLanguage
English
Publisher
ieee
Conference_Titel
Automatic Speech Recognition and Understanding (ASRU), 2013 IEEE Workshop on
Conference_Location
Olomouc
Type
conf
DOI
10.1109/ASRU.2013.6707732
Filename
6707732
Link To Document