DocumentCode :
1798682
Title :
Speech emotion recognition based on dynamic models
Author :
Guoyun Lv ; Shuixian Hu ; Xipan Lu
Author_Institution :
Sch. of Electron. & Inf., Northwestern Polytech. Univ., Xi´an, China
fYear :
2014
fDate :
7-9 July 2014
Firstpage :
480
Lastpage :
484
Abstract :
This paper introduced the semi-continuous Hidden Markov Model (HMM) and proposed a novel Dynamic Bayesian Network (DBN) model for dynamic speech emotion recognition. The former reduces the training complexity caused by mixture Gaussians by sharing the Condition Probability Densities (CPDs) of Gaussians among the states, and the latter adds a sub-state layer between state and observation layer based on traditional DBN framework and describes the dynamic process of speech emotion in detail. Experiments results show that average emotion recognition rate of semi-continuous HMM is 4% and 10% higher than those of classical HMM and Mixture Gaussian HMM respectively, and average emotion recognition rate of the three-layer DBN model is 11% and 8% higher than those of traditional DBN model and semi-continuous HMM.
Keywords :
Bayes methods; Gaussian processes; emotion recognition; hidden Markov models; speech recognition; Gaussian CPDs; Gaussian condition probability densities; dynamic Bayesian network model; dynamic speech emotion recognition; semicontinuous HMM; semicontinuous hidden Markov model; three-layer DBN model; Emotion recognition; Feature extraction; Hidden Markov models; Speech; Speech processing; Speech recognition; Support vector machines; dynamic bayesian network; dynamic model; emotion recognition; hidden markov model;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Audio, Language and Image Processing (ICALIP), 2014 International Conference on
Conference_Location :
Shanghai
Print_ISBN :
978-1-4799-3902-2
Type :
conf
DOI :
10.1109/ICALIP.2014.7009840
Filename :
7009840
Link To Document :
بازگشت