Title :
Speech emotion classification via a modified Gaussian Mixture Model approach
Author :
Hosseini, Zeinab ; Ahadi, Seyed Mohammad ; Faraji, Neda
Author_Institution :
Speech Process. Res. Lab., Amirkabir Univ. of Technol., Tehran, Iran
Abstract :
Emotional state of the speaker is an important feature embedded in his/her produced speech signal. Despite emotion recognition importance in system performance improvement, such as in ASR, not much research has been carried out in the speech emotion classification field. This paper is focused on finding more effective approaches to improve speaker emotional state classification methods. Two approaches are proposed for training and test phases while the Gaussian Mixture Model (GMM) is selected as the classifier. In these approaches, the motivation is to reduce the confusing information regions of emotional speech space and to increase salience of the discriminative regions. In the training phase, symmetric Kullback-Leibler Divergence (KLD) is used as a measure to detect the discriminative GMM mixtures while the confusing mixtures are ignored. This algorithm is known as KLD-GMM. In the test phase, the discriminative frames are recognized based on Frame Selection Decoding (FSD). This algorithm is known as FSD-GMM, when FSD algorithm is applied on KLD-GMM algorithm, the approach is named KLD-FSD-GMM algorithm. Two proposed algorithms have led to an average absolute improvement of about 7% in the emotion recognition performance in comparison with the baseline generalized GMM-based method.
Keywords :
Gaussian processes; emotion recognition; mixture models; speech recognition; Gaussian mixture model; KLD-FSD-GMM algorithm; discriminative GMM mixtures; discriminative frames; frame selection decoding; modified gaussian mixture model approach; speech emotion classification; speech signal; symmetric Kullback-Leibler divergence; system performance improvement; Classification algorithms; Emotion recognition; Feature extraction; Speech; Speech processing; Speech recognition; Training; Berlin emotional database (EMO-DB); GMM; KLD; emotion classification; frame selection;
Conference_Titel :
Telecommunications (IST), 2014 7th International Symposium on
Conference_Location :
Tehran
Print_ISBN :
978-1-4799-5358-5
DOI :
10.1109/ISTEL.2014.7000752