Title :
Classifying emotions in human-machine spoken dialogs
Author :
Lee, Chul Min ; Narayanan, Shrikanth S. ; Pieraccini, Roberto
Author_Institution :
Dept. of Electr. Eng., Univ. of Southern California, Los Angeles, CA, USA
fDate :
6/24/1905 12:00:00 AM
Abstract :
This paper reports on the comparison between various acoustic feature sets and classification algorithms for classifying spoken utterances based on the emotional state of the speaker. The data set used for the analysis comes from a corpus of human-machine dialogs obtained from a commercial application. Emotion recognition is posed as a pattern recognition problem. We used three different techniques - linear discriminant classifier (LDC), k-nearest neighborhood (k-NN) classifier, and support vector machine classifier (SVC) -for classifying utterances into 2 emotion classes: negative and non-negative. In this study, two feature sets were used; the base feature set obtained from the utterance-level statistics of the pitch and energy of the speech, and the feature set analyzed by principal component analysis (PCA). PCA showed a performance comparable to the base feature sets. Overall, the LDC achieved the best performance with error rates of 27.54% on female data and 25.46% on males with the base feature set. The SVC, however, showed a better performance in the problem of data sparsity.
Keywords :
acoustic signal processing; emotion recognition; feature extraction; learning automata; man-machine systems; principal component analysis; signal classification; PCA; acoustic feature sets; automatic negative emotions recognition; classification algorithms; data sparsity; emotions classification; error rates; feature extraction; female data; human-machine spoken dialogs; k-nearest neighborhood classifier; linear discriminant classifier; males data; nonnegative emotion; pattern recognition; principal component analysis; speech energy; speech pitch; speech signals; spoken utterances classification; support vector machine classifier; utterance-level statistics; Classification algorithms; Emotion recognition; Linear discriminant analysis; Loudspeakers; Man machine systems; Pattern recognition; Principal component analysis; Speech analysis; Static VAr compensators; Support vector machines;
Conference_Titel :
Multimedia and Expo, 2002. ICME '02. Proceedings. 2002 IEEE International Conference on
Print_ISBN :
0-7803-7304-9
DOI :
10.1109/ICME.2002.1035887