• DocumentCode
    2550245
  • Title

    Speech Emotion Classification Using Machine Learning Algorithms

  • Author

    Casale, S. ; Russo, A. ; Scebba, G. ; Serrano, S.

  • Author_Institution
    Dipt. di Ing. Inf. e delle Telecomun. Fac. di Ing., Univ. di Catania, Catania
  • fYear
    2008
  • fDate
    4-7 Aug. 2008
  • Firstpage
    158
  • Lastpage
    165
  • Abstract
    The recognition of emotional states is a relatively new technique in the field of machine learning. The paper presents the study and the performance results of a system for emotion classification using the architecture of a distributed speech recognition system (DSR). The features used were extracted by the front-end ETSI Aurora eXtended of a mobile terminal in compliance with the ETSI ES 202-211 V1.1.1 standard. On the basis of the time trend of these parameters, over 3800 statistical parameters were extracted to characterize semantic units of varying length (sentences and words). Using the WEKA (Waikato Environment for Knowledge Analysis) software the most significant parameters for the classification of emotional states were selected and the results of various classification techniques were analysed. The results, obtained using both the Berlin Database of Emotional Speech (EMO-DB) and the Speech Under Simulated and Actual Stress (SUSAS) corpus, showed that the best performance is achieved using a support vector machine (SVM) trained with the sequential minimal optimization (SMO) algorithm, after normalizing and discretizing the input statistical parameters.
  • Keywords
    learning (artificial intelligence); optimisation; pattern classification; speech recognition; support vector machines; Berlin Database of Emotional Speech; SVM; Speech Under Simulated and Actual Stress; WEKA; Waikato Environment for Knowledge Analysis; distributed speech recognition system; machine learning algorithm; sentences; sequential minimal optimization algorithm; speech emotion classification; support vector machine; words; Databases; Emotion recognition; Feature extraction; Machine learning; Machine learning algorithms; Speech recognition; Stress; Support vector machine classification; Support vector machines; Telecommunication standards; ETSI ES 202-211 standard; emo-db; emotion classification; machine learning; speech analysis; susas;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Semantic Computing, 2008 IEEE International Conference on
  • Conference_Location
    Santa Clara, CA
  • Print_ISBN
    978-0-7695-3279-0
  • Electronic_ISBN
    978-0-7695-3279-0
  • Type

    conf

  • DOI
    10.1109/ICSC.2008.43
  • Filename
    4597187