DocumentCode
2541516
Title
Speech Emotion Recognition Research Based on the Stacked Generalization Ensemble Neural Network for Robot Pet
Author
Huang, Yongming ; Zhang, Guobao ; Xu, Xiaoli
Author_Institution
Sch. of Autom., Southeast Univ., Nanjing, China
fYear
2009
fDate
4-6 Nov. 2009
Firstpage
1
Lastpage
5
Abstract
In this paper, we present an emotion recognition system using the stacked generalization ensemble neural network for special human affective state in the speech signal. 450 short emotional sentences with different contents from 3 speakers were collected as experiment materials. The features relevant with energy, speech rate, pitch and formant are extracted from speech signals. Stacked generalization ensemble neural networks are used as the classifier for 5 emotions including anger, calmness, happiness, sadness and boredom. First, compared with the traditional BP network or wavelet neural network, the results of experiments show that the stacked generalization ensemble neural network has faster convergence speed and higher recognition rate. Second, after discussing the advantage and disadvantage between different ensemble neural networks, suitable decision will be made for robot pet.
Keywords
emotion recognition; feature extraction; intelligent robots; neurocontrollers; signal classification; speech recognition; BP network; convergence speed; intelligent robot pet; special human affective state; speech emotion recognition system; speech signal extraction; stacked generalization ensemble neural network classifier; wavelet neural network; Emotion recognition; Filtering; Frequency; Humans; Linear predictive coding; Neural networks; Positron emission tomography; Robotics and automation; Robots; Speech processing;
fLanguage
English
Publisher
ieee
Conference_Titel
Pattern Recognition, 2009. CCPR 2009. Chinese Conference on
Conference_Location
Nanjing
Print_ISBN
978-1-4244-4199-0
Type
conf
DOI
10.1109/CCPR.2009.5344020
Filename
5344020
Link To Document