Title :
Speech Emotion Classification Using Machine Learning Algorithms
Author :
Casale, S. ; Russo, A. ; Scebba, G. ; Serrano, S.
Author_Institution :
Dipt. di Ing. Inf. e delle Telecomun. Fac. di Ing., Univ. di Catania, Catania
Abstract :
The recognition of emotional states is a relatively new technique in the field of machine learning. The paper presents the study and the performance results of a system for emotion classification using the architecture of a distributed speech recognition system (DSR). The features used were extracted by the front-end ETSI Aurora eXtended of a mobile terminal in compliance with the ETSI ES 202-211 V1.1.1 standard. On the basis of the time trend of these parameters, over 3800 statistical parameters were extracted to characterize semantic units of varying length (sentences and words). Using the WEKA (Waikato Environment for Knowledge Analysis) software the most significant parameters for the classification of emotional states were selected and the results of various classification techniques were analysed. The results, obtained using both the Berlin Database of Emotional Speech (EMO-DB) and the Speech Under Simulated and Actual Stress (SUSAS) corpus, showed that the best performance is achieved using a support vector machine (SVM) trained with the sequential minimal optimization (SMO) algorithm, after normalizing and discretizing the input statistical parameters.
Keywords :
learning (artificial intelligence); optimisation; pattern classification; speech recognition; support vector machines; Berlin Database of Emotional Speech; SVM; Speech Under Simulated and Actual Stress; WEKA; Waikato Environment for Knowledge Analysis; distributed speech recognition system; machine learning algorithm; sentences; sequential minimal optimization algorithm; speech emotion classification; support vector machine; words; Databases; Emotion recognition; Feature extraction; Machine learning; Machine learning algorithms; Speech recognition; Stress; Support vector machine classification; Support vector machines; Telecommunication standards; ETSI ES 202-211 standard; emo-db; emotion classification; machine learning; speech analysis; susas;
Conference_Titel :
Semantic Computing, 2008 IEEE International Conference on
Conference_Location :
Santa Clara, CA
Print_ISBN :
978-0-7695-3279-0
Electronic_ISBN :
978-0-7695-3279-0
DOI :
10.1109/ICSC.2008.43