DocumentCode :
153630
Title :
Automatic emotion variation detection using multi-scaled sliding window
Author :
Yuchao Fan ; Mingxing Xu ; Zhiyong Wu ; Lianhong Cai
Author_Institution :
Minist. of Educ. Tsinghua Nat. Lab. for Inf. Sci. & Technol. (TNList) Dept. of Comput. Sci. & Technol., Tsinghua Univ., Beijing, China
fYear :
2014
fDate :
20-23 Sept. 2014
Firstpage :
232
Lastpage :
236
Abstract :
Emotion recognition from speech plays an important role in developing affective and intelligent Human Computer Interaction. The goal of this work is to build an Automatic Emotion Variation Detection (AEVD) system to determine each emotional salient segment in continuous speech. We focus on emotion detection in angry-neutral speech, which is common in recent studies of AEVD. This study proposes a novel framework for AEVD using Multi-scaled Sliding Window (MSW-AEVD) to assign an emotion class to each window-shift by fusion decisions of all the sliding windows containing the shift. Firstly, sliding window with fixed-length is introduced as the basic procedure, in which several different fusion methods are investigated. Then multi-scaled sliding window is employed to support multi-classifiers with different timescale features, in which another two fusion strategies are provided. Finally, a postprocessing is applied to refine the final outputs. Performance evaluation is carried out on the public Berlin database EMO-DB. Our experimental results show that proposed MSW-AEVD significantly outperforms the traditional HMM-based AEVD.
Keywords :
emotion recognition; human computer interaction; sensor fusion; signal classification; speech recognition; AEVD system; EMO-DB; MSW-AEVD; affective human computer interaction; angry-neutral speech; automatic emotion variation detection; continuous speech; emotion class assignment; emotion detection; emotion recognition; emotional salient segment; fixed-length sliding window; fusion decision; fusion method; fusion strategy; intelligent human computer interaction; multiclassifiers; multiscaled sliding window; performance evaluation; public Berlin database; window-shift; Accuracy; Emotion recognition; Feature extraction; Hidden Markov models; Speech; Speech recognition; Support vector machines; Emotion Detection; Emotion Variation; Multi-Scaled Sliding Window;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Orange Technologies (ICOT), 2014 IEEE International Conference on
Conference_Location :
Xian
Type :
conf
DOI :
10.1109/ICOT.2014.6956642
Filename :
6956642
Link To Document :
بازگشت