DocumentCode
134327
Title
Speech emotion recognition based on wavelet packet coefficient model
Author
Kunxia Wang ; Ning An ; Lian Li
Author_Institution
Gerontechnology Lab., Hefei Univ. of Technol., Hefei, China
fYear
2014
fDate
12-14 Sept. 2014
Firstpage
478
Lastpage
482
Abstract
Conventional features have achieved good performance in speech emotion recognition. However, these features are based on short-time analysis without considering the non-stationary properties. In this paper we focus on wavelet packet techniques, which can provide an improved signal representation with a tradeoff between time and frequency resolution. We propose a wavelet packet coefficient model in speech emotion recognition. The wavelet packet coefficients at five decomposition levels are analyzed and used as input features to Support Vector Machine (SVM) classifiers. The performances of these features are evaluated for seven emotional states in two languages, German and Chinese. Results demonstrate that these wavelet packet coefficients features show improvement in emotion recognition performance compared with conventional Mel-Frequency Cepstral Coefficients (MFCC) features.
Keywords
emotion recognition; signal classification; speech recognition; support vector machines; wavelet transforms; Chinese language; German language; MFCC features; Mel-frequency cepstral coefficients; SVM classifier; decomposition level; frequency resolution; short-time analysis; signal representation; speech emotion recognition; support vector machine; time resolution; wavelet packet coefficient model; wavelet packet techniques; Databases; Emotion recognition; Speech; Speech recognition; Wavelet analysis; Wavelet packets; speech emotion recognition; wavelet packet; wavelet packet coefficient;
fLanguage
English
Publisher
ieee
Conference_Titel
Chinese Spoken Language Processing (ISCSLP), 2014 9th International Symposium on
Conference_Location
Singapore
Type
conf
DOI
10.1109/ISCSLP.2014.6936710
Filename
6936710
Link To Document