DocumentCode :
2605221
Title :
Speaker-independent negative emotion recognition
Author :
Kotti, Margarita ; Paternò, Fabio ; Kotropoulos, Constantine
Author_Institution :
ISTI, CNR, Pisa, Italy
fYear :
2010
fDate :
14-16 June 2010
Firstpage :
417
Lastpage :
422
Abstract :
This work aims to provide a method able to distinguish between negative and non-negative emotions in vocal interaction. A large pool of 1418 features is extracted for that purpose. Several of those features are tested in emotion recognition for the first time. Next, feature selection is applied separately to male and female utterances. In particular, a bidirectional Best First search with backtracking is applied. The first contribution is the demonstration that a significant number of features, first tested here, are retained after feature selection. The selected features are then fed as input to support vector machines with various kernel functions as well as to the K nearest neighbors classifier. The second contribution is in the speaker-independent experiments conducted in order to cope with the limited number of speakers present in the commonly used emotion speech corpora. Speaker-independent systems are known to be more robust and present a better generalization ability than the speaker-dependent ones. Experimental results are reported for the Berlin emotional speech database. The best performing classifier is found to be the support vector machine with the Gaussian radial basis function kernel. Correctly classified utterances are 86.73%±3.95% for male subjects and 91.73%±4.18% for female subjects. The last contribution is in the statistical analysis of the performance of the support vector machine classifier against the K nearest neighbors classifier as well as the statistical analysis of the various support vector machine kernels impact.
Keywords :
audio databases; radial basis function networks; search problems; speaker recognition; support vector machines; Gaussian radial basis function kernel; K nearest neighbors classifier; bidirectional best first search; emotion speech corpora; emotional speech database; feature extraction; feature selection; speaker-independent negative emotion recognition; support vector machines; vocal interaction; Databases; Emotion recognition; Feature extraction; Kernel; Mel frequency cepstral coefficient; Speech; Support vector machines;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Cognitive Information Processing (CIP), 2010 2nd International Workshop on
Conference_Location :
Elba
Print_ISBN :
978-1-4244-6457-9
Type :
conf
DOI :
10.1109/CIP.2010.5604091
Filename :
5604091
Link To Document :
بازگشت