Title :
Excitation source and low level descriptor features fusion for emotion recognition using SVM and ANN
Author :
Al-Talabani, Abdulbasit ; Sellahewa, Harin ; Jassim, S.
Author_Institution :
Appl. Comput. Dept., Univ. of Buckingham, Buckingham, UK
Abstract :
Emotion recognition is a challenging task with many applications in healthcare and human-machine interaction. In this study we propose to fuse two sets of features for emotion recognition at the classification level. A set of features that includes LPCC and MFCC extracted from LP-residual samples and Wavelet Octave Coefficient Of Residual (WOCOR) is proposed in this study as excitation source features. The proposed set of features is fused with 6552 LLDs using SVM and ANN classifiers. The experiments are tested on a newly acquired emotional speech database in Kurdish language, the Berlin emotional speech database, and the Aibo database. The experiments demonstrate that the fusion of the proposed excitation source features with the common LLDs can achieve better recognition accuracies than what is reported in the state-of-the-art studies.
Keywords :
cepstral analysis; emotion recognition; feature extraction; natural language processing; neural nets; pattern classification; speech processing; support vector machines; ANN classifier; Aibo database; Berlin emotional speech database; Kurdish language; LLD; LP-residual samples; LPCC; MFCC; SVM classifier; WOCOR; classification level; descriptor features fusion; emotion recognition; excitation source features; healthcare; human-machine interaction; recognition accuracy; wavelet octave coefficient of residual; Accuracy; Artificial neural networks; Databases; Emotion recognition; Feature extraction; Speech; Support vector machines; Emotion; Feature extraction; LP-Residuai; Pattern Recognition;
Conference_Titel :
Computer Science and Electronic Engineering Conference (CEEC), 2013 5th
Conference_Location :
Colchester
DOI :
10.1109/CEEC.2013.6659464