DocumentCode :
2929815
Title :
Emotion recognition from speech VIA boosted Gaussian mixture models
Author :
Tang, Hao ; Chu, Stephen M. ; Hasegawa-Johnson, Mark ; Huang, Thomas S.
Author_Institution :
Dept. of Electr. & Comput. Eng., Univ. of Illinois at Urbana-Champaign, Urbana, IL, USA
fYear :
2009
fDate :
June 28 2009-July 3 2009
Firstpage :
294
Lastpage :
297
Abstract :
Gaussian mixture models (GMMs) and the minimum error rate classifier (i.e. Bayesian optimal classifier) are popular and effective tools for speech emotion recognition. Typically, GMMs are used to model the class-conditional distributions of acoustic features and their parameters are estimated by the expectation maximization (EM) algorithm based on a training data set. Then, classification is performed to minimize the classification error w.r.t. the estimated class-conditional distributions. We call this method the EM-GMM algorithm. In this paper, we introduce a Boosting algorithm for reliably and accurately estimating the class-conditional GMMs. The resulting algorithm is named the Boosted-GMM algorithm. Our speech emotion recognition experiments show that the emotion recognition rates are effectively and significantly "Boosted" by the Boosted-GMM algorithm as compared to the EM-GMM algorithm. This is due to the fact that the Boosting algorithm can lead to more accurate estimates of the class-conditional GMMs, namely the class-conditional distributions of acoustic features.
Keywords :
Bayes methods; Gaussian processes; acoustic signal processing; emotion recognition; expectation-maximisation algorithm; learning (artificial intelligence); signal classification; speech recognition; Bayesian optimal classifier; Boosted Gaussian mixture model; EM-GMM algorithm; acoustic feature; class-conditional distribution; expectation maximization algorithm; minimum error rate classifier; speech emotion recognition; training data set; Bayesian methods; Boosting; Data mining; Emotion recognition; Hidden Markov models; Pattern recognition; Signal processing algorithms; Speech enhancement; Speech recognition; Training data; Bayesian optimal classifier; EM algorithm; Emotion recognition; Gaussian mixture model; boosting;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Multimedia and Expo, 2009. ICME 2009. IEEE International Conference on
Conference_Location :
New York, NY
ISSN :
1945-7871
Print_ISBN :
978-1-4244-4290-4
Electronic_ISBN :
1945-7871
Type :
conf
DOI :
10.1109/ICME.2009.5202493
Filename :
5202493
Link To Document :
بازگشت