• DocumentCode
    681693
  • Title

    Detection of affective states from speech signals using ensembles of classifiers

  • Author

    Kobayashi, V.B. ; Calag, V.B.

  • Author_Institution
    Univ. of the Philippines Mindanao, Davao, Philippines
  • fYear
    2013
  • fDate
    2-3 Dec. 2013
  • Firstpage
    1
  • Lastpage
    9
  • Abstract
    Recently, the focus of investigation has been the design of classifier systems that achieve optimal classification accuracy in detecting affective states from speech signals. Previous works have shown the inadequacy of single classifier models to deal with this problem. In this work we propose to use ensemble learning techniques such as random forest and kernel factory as classifiers in the design of a speech emotion recognition system. The system proceeds from speech signal pre-processing, feature extraction, construction of classifiers, and finally to the prediction of emotion. Features related to fundamental frequency, energy, mel-frequency cepstrum coefficients and linear predictive cepstrum coefficients were extracted from the individual segments. Subsequently, we trained two ensembles classifiers, namely, random forest and kernel factory. We have tested our approach on a number of speech databases. The results showed that ensemble classifiers yielded superior classification performance compared to single models by at most 20% increase. We also found out that our results exceeded the results of existing studies. We concluded that ensemble classifiers are effective for the identification of emotions, hence, suitable for this domain.
  • Keywords
    emotion recognition; feature extraction; learning (artificial intelligence); pattern classification; speech processing; speech recognition; affective states; classifier systems; classifiers construction; emotion prediction; ensemble learning techniques; feature extraction; fundamental frequency; kernel factory; linear predictive cepstrum coefficients; mel-frequency cepstrum coefficients; random forest; speech emotion recognition system; speech signal preprocessing; speech signals; Emotional Speech Recognition; Ensemble Learning; Kernel Factory; Random Forest;
  • fLanguage
    English
  • Publisher
    iet
  • Conference_Titel
    Intelligent Signal Processing Conference 2013 (ISP 2013), IET
  • Conference_Location
    London
  • Electronic_ISBN
    978-1-84919-774-8
  • Type

    conf

  • DOI
    10.1049/cp.2013.2067
  • Filename
    6740516