Title :
Stress and emotion recognition using log-Gabor filter analysis of speech spectrograms
Author :
He, Ling ; Lech, Margaret ; Maddage, Namunu ; Allen, Nicholas
Author_Institution :
Sch. of Electr. & Comput. Eng., RMIT Univ., Melbourne, VIC, Australia
Abstract :
We present new methods that extract characteristic features from speech magnitude spectrograms. Two of the presented approaches have been found particularly efficient in the process of automatic stress and emotion classification. In the first approach, the spectrograms are sub-divided into ERB frequency bands and the average energy for each band is calculated. In the second approach, the spectrograms are passed through a bank of 12 log-Gabor filters and the outputs are averaged and passed through an optimal feature selection procedure based on mutual information criteria. The proposed methods were tested using single vowels, words and sentences from SUSAS data base with 3 classes of stress, and spontaneous speech recordings made by psychologists (ORI) with 5 emotional classes. The classification results based on the Gaussian mixture model show correct classification rates of 40%-81%, for different SUSAS data sets and 40%-53.4% for the ORI data base.
Keywords :
Gabor filters; Gaussian processes; emotion recognition; Gaussian mixture model; emotion classification; emotion recognition; log-Gabor filter analysis; speech magnitude spectrograms; stress recognition; Band pass filters; Emotion recognition; Feature extraction; Frequency; Information filtering; Information filters; Mutual information; Spectrogram; Speech analysis; Stress;
Conference_Titel :
Affective Computing and Intelligent Interaction and Workshops, 2009. ACII 2009. 3rd International Conference on
Conference_Location :
Amsterdam
Print_ISBN :
978-1-4244-4800-5
Electronic_ISBN :
978-1-4244-4799-2
DOI :
10.1109/ACII.2009.5349454