DocumentCode :
681693
Title :
Detection of affective states from speech signals using ensembles of classifiers
Author :
Kobayashi, V.B. ; Calag, V.B.
Author_Institution :
Univ. of the Philippines Mindanao, Davao, Philippines
fYear :
2013
fDate :
2-3 Dec. 2013
Firstpage :
1
Lastpage :
9
Abstract :
Recently, the focus of investigation has been the design of classifier systems that achieve optimal classification accuracy in detecting affective states from speech signals. Previous works have shown the inadequacy of single classifier models to deal with this problem. In this work we propose to use ensemble learning techniques such as random forest and kernel factory as classifiers in the design of a speech emotion recognition system. The system proceeds from speech signal pre-processing, feature extraction, construction of classifiers, and finally to the prediction of emotion. Features related to fundamental frequency, energy, mel-frequency cepstrum coefficients and linear predictive cepstrum coefficients were extracted from the individual segments. Subsequently, we trained two ensembles classifiers, namely, random forest and kernel factory. We have tested our approach on a number of speech databases. The results showed that ensemble classifiers yielded superior classification performance compared to single models by at most 20% increase. We also found out that our results exceeded the results of existing studies. We concluded that ensemble classifiers are effective for the identification of emotions, hence, suitable for this domain.
Keywords :
emotion recognition; feature extraction; learning (artificial intelligence); pattern classification; speech processing; speech recognition; affective states; classifier systems; classifiers construction; emotion prediction; ensemble learning techniques; feature extraction; fundamental frequency; kernel factory; linear predictive cepstrum coefficients; mel-frequency cepstrum coefficients; random forest; speech emotion recognition system; speech signal preprocessing; speech signals; Emotional Speech Recognition; Ensemble Learning; Kernel Factory; Random Forest;
fLanguage :
English
Publisher :
iet
Conference_Titel :
Intelligent Signal Processing Conference 2013 (ISP 2013), IET
Conference_Location :
London
Electronic_ISBN :
978-1-84919-774-8
Type :
conf
DOI :
10.1049/cp.2013.2067
Filename :
6740516
Link To Document :
بازگشت