DocumentCode :
3132443
Title :
Comparison of adaptation methods for GMM-SVM based speech emotion recognition
Author :
Jianbo Jiang ; Zhiyong Wu ; Mingxing Xu ; Jia Jia ; Lianhong Cai
Author_Institution :
Tsinghua-CUHK Joint Res. Center for Media Sci., Technol. & Syst., Tsinghua Univ., Shenzhen, China
fYear :
2012
fDate :
2-5 Dec. 2012
Firstpage :
269
Lastpage :
273
Abstract :
The required length of the utterance is one of the key factors affecting the performance of automatic emotion recognition. To gain the accuracy rate of emotion distinction, adaptation algorithms that can be manipulated on short utterances are highly essential. Regarding this, this paper compares two classical model adaptation methods, maximum a posteriori (MAP) and maximum likelihood linear regression (MLLR), in GMM-SVM based emotion recognition, and tries to find which method can perform better on different length of the enrollment of the utterances. Experiment results show that MLLR adaptation performs better for very short enrollment utterances (with the length shorter than 2s) while MAP adaptation is more effective for longer utterances.
Keywords :
Gaussian processes; emotion recognition; maximum likelihood estimation; regression analysis; speech recognition; support vector machines; GMM-SVM based speech emotion recognition; MAP; MLLR; automatic emotion recognition; emotion distinction; maximum a posteriori; maximum likelihood linear regression; model adaptation methods; short utterances; Adaptation models; Databases; Emotion recognition; Hidden Markov models; Speech; Speech recognition; Support vector machines; GMM supervector based SVM; MAP adaptation; MLLR adaptation; emotion recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Spoken Language Technology Workshop (SLT), 2012 IEEE
Conference_Location :
Miami, FL
Print_ISBN :
978-1-4673-5125-6
Electronic_ISBN :
978-1-4673-5124-9
Type :
conf
DOI :
10.1109/SLT.2012.6424234
Filename :
6424234
Link To Document :
بازگشت