DocumentCode
3049076
Title
A Hybrid Speech Emotion Perception Method of VQ-based Feature Processing and ANN Recognition
Author
Wenjing, Han ; Haifeng, Li ; Chunyu, Guo
Author_Institution
Sch. of Comput. Sci. & Technol., Harbin Inst. of Technol., Harbin, China
Volume
2
fYear
2009
fDate
19-21 May 2009
Firstpage
145
Lastpage
149
Abstract
This paper constructs a VQ/ANN (vector quantization/artificial neural network) based speech emotion recognition system. The system first extracts the basic prosodic parameters and Mel-frequency cepstral coefficients (MFCC) frame by frame. Recent researches reveal that MFCC convey detailed emotional relevant information of syllable. However, the statistic measures of MFCC confuse the information at sentence level. Therefore, this paper proposes a VQ-based method different to statistic method to generate measures of MFCC. Then the combination of VQ-based MFCC measures and the statistic measures of prosodic parameters is used as input feature vector. The ANN is performed to process the combination features and the statistic measures of all extracted parameters respectively. The experiment results reveal that the combination features outperform the statistic measures. More detailed analysis indicates that the combination features could characterize the emotion space better than the statistic features. Besides, the rationality of VQ/ANN based framework is also demonstrated.
Keywords
cepstral analysis; emotion recognition; neural nets; speech processing; vector quantisation; ANN recognition; VQ-based feature processing; artificial neural network; mel-frequency cepstral coefficients; speech emotion perception method; vector quantization; Artificial neural networks; Cepstral analysis; Data mining; Emotion recognition; Mel frequency cepstral coefficient; Performance evaluation; Speech processing; Speech recognition; Statistics; Vector quantization; artificial neural network; emtoion features; speech emotion recognition; vector quantization;
fLanguage
English
Publisher
ieee
Conference_Titel
Intelligent Systems, 2009. GCIS '09. WRI Global Congress on
Conference_Location
Xiamen
Print_ISBN
978-0-7695-3571-5
Type
conf
DOI
10.1109/GCIS.2009.432
Filename
5209400
Link To Document