• DocumentCode
    714182
  • Title

    Automatic emotion recognition using auditory and prosodic indicative features

  • Author

    Gharsellaoui, Soumaya ; Selouani, Sid-Ahmed ; Dahmane, Adel Omar

  • Author_Institution
    Univ. du Quebec a Trois-Rivieres, Trois-Rivieres, QC, Canada
  • fYear
    2015
  • fDate
    3-6 May 2015
  • Firstpage
    1265
  • Lastpage
    1270
  • Abstract
    In this paper, a new framework for the automatic recognition of human emotions from speech was proposed. Besides auditory indicative features, selected prosodic and voice quality parameters were optimally combined with Mel frequency coefficients to perform an automatic emotion classification. For this purpose, the Emotion Prosody Speech and Transcript database, a certified speech corpus, was used throughout this study. An extensive set of experiments have been carried out in order to assess the effectiveness of this original mixture of prosodic, perceptual and auditory features to perform the emotion recognition task. These features were selected by using Linear Discriminant Analysis (LDA) and Principal Component Analysis (PCA) on the basis of their ability of discrimination. The selected features were used by the front-end processing stage of a hybrid Gaussian Mixture Model and Support Vector Machines (GSVMs) to perform the emotion classification. The results showed the effectiveness of the proposed feature extraction framework to discriminate between different human emotions when the LDA-PCA-GSVM classifier was used.
  • Keywords
    Gaussian processes; emotion recognition; feature extraction; mixture models; pattern classification; principal component analysis; set theory; speech recognition; support vector machines; LDA-PCA-GSVM classifier; auditory indicative features; automatic emotion classification; automatic emotion recognition; automatic human emotion recognition; certified speech corpus; emotion prosody speech and transcript database; feature extraction framework; front-end processing stage; hybrid Gaussian mixture model; linear discriminant analysis; mel frequency coefficients; principal component analysis; prosodic indicative features; speech recognition; support vector machines; voice quality parameters; Band-pass filters; Ear; Emotion recognition; Feature extraction; Principal component analysis; Speech; Support vector machines;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Electrical and Computer Engineering (CCECE), 2015 IEEE 28th Canadian Conference on
  • Conference_Location
    Halifax, NS
  • ISSN
    0840-7789
  • Print_ISBN
    978-1-4799-5827-6
  • Type

    conf

  • DOI
    10.1109/CCECE.2015.7129460
  • Filename
    7129460