مرکز منطقه ای اطلاع رساني علوم و فناوري - Speech emotion classification via a modified Gaussian Mixture Model approach

DocumentCode :

1787073

Title :

Speech emotion classification via a modified Gaussian Mixture Model approach

Author :

Hosseini, Zeinab ; Ahadi, Seyed Mohammad ; Faraji, Neda

Author_Institution :

Speech Process. Res. Lab., Amirkabir Univ. of Technol., Tehran, Iran

fYear :

2014

fDate :

9-11 Sept. 2014

Firstpage :

487

Lastpage :

491

Abstract :

Emotional state of the speaker is an important feature embedded in his/her produced speech signal. Despite emotion recognition importance in system performance improvement, such as in ASR, not much research has been carried out in the speech emotion classification field. This paper is focused on finding more effective approaches to improve speaker emotional state classification methods. Two approaches are proposed for training and test phases while the Gaussian Mixture Model (GMM) is selected as the classifier. In these approaches, the motivation is to reduce the confusing information regions of emotional speech space and to increase salience of the discriminative regions. In the training phase, symmetric Kullback-Leibler Divergence (KLD) is used as a measure to detect the discriminative GMM mixtures while the confusing mixtures are ignored. This algorithm is known as KLD-GMM. In the test phase, the discriminative frames are recognized based on Frame Selection Decoding (FSD). This algorithm is known as FSD-GMM, when FSD algorithm is applied on KLD-GMM algorithm, the approach is named KLD-FSD-GMM algorithm. Two proposed algorithms have led to an average absolute improvement of about 7% in the emotion recognition performance in comparison with the baseline generalized GMM-based method.

Keywords :

Gaussian processes; emotion recognition; mixture models; speech recognition; Gaussian mixture model; KLD-FSD-GMM algorithm; discriminative GMM mixtures; discriminative frames; frame selection decoding; modified gaussian mixture model approach; speech emotion classification; speech signal; symmetric Kullback-Leibler divergence; system performance improvement; Classification algorithms; Emotion recognition; Feature extraction; Speech; Speech processing; Speech recognition; Training; Berlin emotional database (EMO-DB); GMM; KLD; emotion classification; frame selection;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Telecommunications (IST), 2014 7th International Symposium on

Conference_Location :

Tehran

Print_ISBN :

978-1-4799-5358-5

Type :

conf

DOI :

10.1109/ISTEL.2014.7000752

Filename :

7000752

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1787073