DocumentCode :
2235881
Title :
Segment-based approach to the recognition of emotions in speech
Author :
Shami, Mohammad T. ; Kamel, Mohamed S.
Author_Institution :
Pattern Anal. & Machine Intelligence Lab, Waterloo Univ., Ont., Canada
fYear :
2005
fDate :
6-8 July 2005
Abstract :
A new framework for the context and speaker independent recognition of emotions from voice, based on a richer and more natural representation of the speech signal, is proposed. The utterance is viewed as consisting of a series of voiced segments and not as a single object. The voiced segments are first identified and then described using statistical measures of spectral shape, intensity, and pitch contours, calculated at both the segment and the utterance level. Utterance classification is performed by combining the segment classification decisions using a fixed combination scheme. The performance of two learning algorithms, support vector machines and K nearest neighbors, is compared. The proposed approach yields an overall classification accuracy of 87% for 5 emotions, outperforming previous results on a similar database.
Keywords :
emotion recognition; learning (artificial intelligence); speech processing; speech recognition; statistical analysis; support vector machines; emotion speech recognition; learning algorithm; speech signal; statistical measure; support vector machine; utterance classification; voice segmentation; Automatic speech recognition; Biology computing; Data mining; Databases; Emotion recognition; Feature extraction; Humans; Shape measurement; Spectral shape; Speech recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Multimedia and Expo, 2005. ICME 2005. IEEE International Conference on
Print_ISBN :
0-7803-9331-7
Type :
conf
DOI :
10.1109/ICME.2005.1521436
Filename :
1521436
Link To Document :
بازگشت