• DocumentCode
    1787073
  • Title

    Speech emotion classification via a modified Gaussian Mixture Model approach

  • Author

    Hosseini, Zeinab ; Ahadi, Seyed Mohammad ; Faraji, Neda

  • Author_Institution
    Speech Process. Res. Lab., Amirkabir Univ. of Technol., Tehran, Iran
  • fYear
    2014
  • fDate
    9-11 Sept. 2014
  • Firstpage
    487
  • Lastpage
    491
  • Abstract
    Emotional state of the speaker is an important feature embedded in his/her produced speech signal. Despite emotion recognition importance in system performance improvement, such as in ASR, not much research has been carried out in the speech emotion classification field. This paper is focused on finding more effective approaches to improve speaker emotional state classification methods. Two approaches are proposed for training and test phases while the Gaussian Mixture Model (GMM) is selected as the classifier. In these approaches, the motivation is to reduce the confusing information regions of emotional speech space and to increase salience of the discriminative regions. In the training phase, symmetric Kullback-Leibler Divergence (KLD) is used as a measure to detect the discriminative GMM mixtures while the confusing mixtures are ignored. This algorithm is known as KLD-GMM. In the test phase, the discriminative frames are recognized based on Frame Selection Decoding (FSD). This algorithm is known as FSD-GMM, when FSD algorithm is applied on KLD-GMM algorithm, the approach is named KLD-FSD-GMM algorithm. Two proposed algorithms have led to an average absolute improvement of about 7% in the emotion recognition performance in comparison with the baseline generalized GMM-based method.
  • Keywords
    Gaussian processes; emotion recognition; mixture models; speech recognition; Gaussian mixture model; KLD-FSD-GMM algorithm; discriminative GMM mixtures; discriminative frames; frame selection decoding; modified gaussian mixture model approach; speech emotion classification; speech signal; symmetric Kullback-Leibler divergence; system performance improvement; Classification algorithms; Emotion recognition; Feature extraction; Speech; Speech processing; Speech recognition; Training; Berlin emotional database (EMO-DB); GMM; KLD; emotion classification; frame selection;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Telecommunications (IST), 2014 7th International Symposium on
  • Conference_Location
    Tehran
  • Print_ISBN
    978-1-4799-5358-5
  • Type

    conf

  • DOI
    10.1109/ISTEL.2014.7000752
  • Filename
    7000752