Title :
Speech Emotion Recognition Using Fourier Parameters
Author :
Kunxia Wang ; Ning An ; Bing Nan Li ; Yanyong Zhang ; Lian Li
Author_Institution :
Dept. of Electron. Eng., Hefei Univ. of Technol., Hefei, China
fDate :
Jan.-March 1 2015
Abstract :
Recently, studies have been performed on harmony features for speech emotion recognition. It is found in our study that the first- and second-order differences of harmony features also play an important role in speech emotion recognition. Therefore, we propose a new Fourier parameter model using the perceptual content of voice quality and the first- and second-order differences for speaker-independent speech emotion recognition. Experimental results show that the proposed Fourier parameter (FP) features are effective in identifying various emotional states in speech signals. They improve the recognition rates over the methods using Mel frequency cepstral coefficient (MFCC) features by 16.2, 6.8 and 16.6 points on the German database (EMODB), Chinese language database (CASIA) and Chinese elderly emotion database (EESDB). In particular, when combining FP with MFCC, the recognition rates can be further improved on the aforementioned databases by 17.5, 10 and 10.5 points, respectively.
Keywords :
Fourier analysis; emotion recognition; speaker recognition; CASIA; Chinese elderly emotion database; Chinese language database; EESDB; EMODB; FP features; Fourier parameter model; German database; MFCC features; Mel frequency cepstral coefficient; emotional states; harmony features; speaker-independent speech emotion recognition; speech signals; Databases; Emotion recognition; Feature extraction; Harmonic analysis; Mel frequency cepstral coefficient; Speech; Speech recognition; Fourier parameter model; affective computing; speaker-independent; speech emotion recognition;
Journal_Title :
Affective Computing, IEEE Transactions on
DOI :
10.1109/TAFFC.2015.2392101