DocumentCode
2806240
Title
Speech Emotion Recognition Using Both Spectral and Prosodic Features
Author
Zhou, Yu ; Sun, Yanqing ; Zhang, Jianping ; Yan, Yonghong
Author_Institution
ThinkIT Speech Lab., Chinese Acad. of Sci., Beijing, China
fYear
2009
fDate
19-20 Dec. 2009
Firstpage
1
Lastpage
4
Abstract
In this paper, we propose a speech emotion recognition system using both spectral and prosodic features. Most traditional systems have focused on spectral features or prosodic features. Since both the spectral and the prosodic features contain emotion information, it is believed that the combining of spectral features and prosodic features will improve the performance of the emotion recognition system. Therefore, we propose to use both spectral and prosodic features. For spectral features, a GMM super vector based SVM is applied with them. For prosodic features, a set of prosodic features that are clearly correlated with speech emotional states and SVM is also used for emotion recognition. The combination of both spectral features and prosodic features is posed as a data fusion problem to obtain the final decision. Experimental results show that the combining of both spectral features and prosodic features yields the emotion error reduction rate of 18.0% and 52.8%, over using only spectral and prosodic features.
Keywords
Gaussian processes; emotion recognition; speech recognition; support vector machines; GMM super vector; Gaussian mixture model; SVM; prosodic features; spectral features; speech emotion recognition; support vector machines; Acoustics; Cepstral analysis; Emotion recognition; Mel frequency cepstral coefficient; Spatial databases; Speech; Sun; Support vector machine classification; Support vector machines; Training data;
fLanguage
English
Publisher
ieee
Conference_Titel
Information Engineering and Computer Science, 2009. ICIECS 2009. International Conference on
Conference_Location
Wuhan
Print_ISBN
978-1-4244-4994-1
Type
conf
DOI
10.1109/ICIECS.2009.5362730
Filename
5362730
Link To Document