DocumentCode
2331340
Title
Recognizing emotion in speech
Author
Dellaert, Frank ; Polzin, Thomas ; Waibel, Alex
Author_Institution
Sch. of Comput. Sci., Carnegie Mellon Univ., Pittsburgh, PA, USA
Volume
3
fYear
1996
fDate
3-6 Oct 1996
Firstpage
1970
Abstract
The paper explores several statistical pattern recognition techniques to classify utterances according to their emotional content. The authors have recorded a corpus containing emotional speech with over a 1000 utterances from different speakers. They present a new method of extracting prosodic features from speech, based on a smoothing spline approximation of the pitch contour. To make maximal use of the limited amount of training data available, they introduce a novel pattern recognition technique: majority voting of subspace specialists. Using this technique, they obtain classification performance that is close to human performance on the task
Keywords
human factors; pattern classification; psychology; smoothing methods; speech recognition; splines (mathematics); statistical analysis; classification performance; emotion recognition; emotional content; emotional speech corpus; majority voting; pitch contour; prosodic feature extraction; smoothing spline approximation; speech; statistical pattern recognition techniques; subspace specialists; training data; utterance classification; Data mining; Emotion recognition; Feature extraction; Humans; Pattern recognition; Smoothing methods; Speech recognition; Spline; Training data; Voting;
fLanguage
English
Publisher
ieee
Conference_Titel
Spoken Language, 1996. ICSLP 96. Proceedings., Fourth International Conference on
Conference_Location
Philadelphia, PA
Print_ISBN
0-7803-3555-4
Type
conf
DOI
10.1109/ICSLP.1996.608022
Filename
608022
Link To Document