• DocumentCode
    2008967
  • Title

    Speech Emotion Recognition Using Canonical Correlation Analysis and Probabilistic Neural Network

  • Author

    Cen, Ling ; Ser, Wee ; Yu, Zhu Liang

  • Author_Institution
    A *STAR, Inst. for Infocomm Res., Singapore, Singapore
  • fYear
    2008
  • fDate
    11-13 Dec. 2008
  • Firstpage
    859
  • Lastpage
    862
  • Abstract
    In this paper, automatic identification of emotional states from human speech is addressed. While several papers have been published in the literature on speech emotion recognition, the features used are taken or modified from those used for speech recognition purposes. However, not all features used for speech recognition are of equal importance for emotion recognition. This paper addresses this issue and proposes a systematic method on feature selection for emotion recognition from speech signals. The idea is to work on a well-selected small feature set and use it to remove irrelevant information. Specifically, the proposed method uses the similar idea of the Canonical Correlation Analysis (CCA) to estimate the linear relationship between the various features and the emotional states. The outcome is a set of features that are of most relevance to the emotions. Experiments have been conducted using the LDC database and with the use of the Probabilistic Neural Network (PNN) as the classification method. The results obtained show that, comparable accuracies can be obtained for the emotional states tested with the use of only about 30% of the features considered. This implies that the computational load can be reduced greatly too.
  • Keywords
    correlation methods; emotion recognition; feature extraction; neural nets; probability; speech processing; speech recognition; statistical analysis; automatic emotional state identification; canonical correlation analysis; feature selection; linear relationship estimation; probabilistic neural network; speech emotion recognition; speech signal; Emotion recognition; Feature extraction; Humans; Machine learning; Neural networks; Spatial databases; Speech analysis; Speech processing; Speech recognition; State estimation; Emotion; Feature selection; Speech; classification;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Machine Learning and Applications, 2008. ICMLA '08. Seventh International Conference on
  • Conference_Location
    San Diego, CA
  • Print_ISBN
    978-0-7695-3495-4
  • Type

    conf

  • DOI
    10.1109/ICMLA.2008.85
  • Filename
    4725081