DocumentCode :
3185204
Title :
Benchmarking classification models for emotion recognition in natural speech: A multi-corporal study
Author :
Tarasov, Alexey ; Delany, Sarah Jane
Author_Institution :
Digital Media Centre, Dublin Inst. of Technol., Dublin, Ireland
fYear :
2011
fDate :
21-25 March 2011
Firstpage :
841
Lastpage :
846
Abstract :
A significant amount of the research on automatic emotion recognition from speech focuses on acted speech that is produced by professional actors. This approach often leads to overoptimistic results as the recognition of emotion in real-life conditions is more challenging due the propensity of mixed and less intense emotions in natural speech. The paper presents an empirical study of the most widely used classifiers in the domain of emotion recognition from speech, across multiple non-acted emotional speech corpora. The results indicate that Support Vector Machines have the best performance and that they along with Multi-Layer Perceptron networks and k-nearest neighbour classifiers perform significantly better (using the appropriate statistical tests) than decision trees, Naïve Bayes classifiers and Radial Basis Function networks.
Keywords :
emotion recognition; multilayer perceptrons; pattern classification; support vector machines; automatic emotion recognition; benchmarking classification models; k-nearest neighbour classifiers; multilayer perceptron networks; natural speech; nonacted emotional speech corpora; support vector machines; Decision trees; Emotion recognition; Kernel; Niobium; Speech; Speech recognition; Training;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Automatic Face & Gesture Recognition and Workshops (FG 2011), 2011 IEEE International Conference on
Conference_Location :
Santa Barbara, CA
Print_ISBN :
978-1-4244-9140-7
Type :
conf
DOI :
10.1109/FG.2011.5771359
Filename :
5771359
Link To Document :
بازگشت