DocumentCode :
661390
Title :
Feature space dimension reduction in speech emotion recognition using support vector machine
Author :
Bo-Chang Chiou ; Chia-Ping Chen
Author_Institution :
Dept. of Comput. Sci. & Eng., Nat. Sun Yat-sen Univ., Kaohsiung, Taiwan
fYear :
2013
fDate :
Oct. 29 2013-Nov. 1 2013
Firstpage :
1
Lastpage :
6
Abstract :
We report implementations of automatic speech emotion recognition systems based on support vector machines in this paper. While common systems often extract a very large feature set per utterance for emotion classification, we conjecture that the dimension of the feature space can be greatly reduced without severe degradation of accuracy. Consequently, we systematically reduce the number of features via feature selection and principal component analysis. The evaluation is carried out on the Berlin Database of Emotional Speech, also known as EMO-DB, which consists of 10 speakers and 7 emotions. The results show that we can trim the feature set to 37 features and still maintain an accuracy of 80%. This means a reduction of more than 99% compared to the baseline system which uses more than 6,000 features.
Keywords :
emotion recognition; feature extraction; feature selection; principal component analysis; speech recognition; support vector machines; Berlin Database-of-Emotional Speech; EMO-DB; automatic speech emotion recognition systems; emotion classification; feature extraction; feature selection; feature space dimension reduction; principal component analysis; support vector machine; utterance; Accuracy; Databases; Mel frequency cepstral coefficient; Principal component analysis; Speech; Speech recognition; Support vector machines;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2013 Asia-Pacific
Conference_Location :
Kaohsiung
Type :
conf
DOI :
10.1109/APSIPA.2013.6694251
Filename :
6694251
Link To Document :
بازگشت