DocumentCode
3317072
Title
Emotions in speech - experiments with prosody and quality features in speech for use in categorical and dimensional emotion recognition environments
Author
Borchert, Martin ; Dusterhoft, A.
Author_Institution
Dept. of Electr. Eng., Univ. of Bus., Technol. & Design, Wismar, Germany
fYear
2005
fDate
30 Oct.-1 Nov. 2005
Firstpage
147
Lastpage
151
Abstract
This paper focuses on features in speech and classification algorithms for using them in emotion recognition software. We illustrate a new approach concentrated on analyzing speech quality features. The quality features are formants, spectral energy distribution in different frequency bands, harmonics-to-noise ratio (in different frequency bands) and irregularities (jitter, shimmer). Some papers (A. Dusterhoft et al., 2003, D. Goleman, 1995) show that there is a relationship between quality features and the valence axis. This paper deals with dimensional approach to classify emotions. Therefore, mainly quality features are taken for the valence axis to classify emotions and mainly prosody features are taken for the arousal axis. Because our experiments show that single emotion recognition rates are up to 90 percent and the recognition rates for speaker independent recognition is about 70% for all classification algorithm, it seems that quality features are more appropriate to differentiate emotions with the same arousal and different valence levels in a dimensional approach. A prototypical emotion recognition software is implemented which is actually tested for analyzing the mood of customers in call centers.
Keywords
call centres; emotion recognition; signal classification; speech recognition; call centers; categorical emotion recognition; dimensional emotion recognition software; emotion classification algorithms; harmonics-to-noise ratio; prosody features; speaker independent recognition; spectral energy distribution; speech quality features; Algorithm design and analysis; Classification algorithms; Emotion recognition; Frequency; Java; Mood; Software algorithms; Software prototyping; Software quality; Speech analysis;
fLanguage
English
Publisher
ieee
Conference_Titel
Natural Language Processing and Knowledge Engineering, 2005. IEEE NLP-KE '05. Proceedings of 2005 IEEE International Conference on
Print_ISBN
0-7803-9361-9
Type
conf
DOI
10.1109/NLPKE.2005.1598724
Filename
1598724
Link To Document