• DocumentCode
    2181430
  • Title

    Iterative feature normalization for emotional speech detection

  • Author

    Busso, Carlos ; Metallinou, Angeliki ; Narayanan, Shrikanth S.

  • Author_Institution
    Dept. of Electr. Eng., Univ. of Texas at Dallas, Dallas, TX, USA
  • fYear
    2011
  • fDate
    22-27 May 2011
  • Firstpage
    5692
  • Lastpage
    5695
  • Abstract
    Contending with signal variability due to source and channel effects is a critical problem in automatic emotion recognition. Any approach in mitigating these effects however has to be done so as to not compromise emotion-relevant information in the signal. A promising approach to this problem has been through feature normalization using features drawn from non-emotional ("neutral") speech samples. This paper considers a scheme for minimizing the inter-speaker differences while still preserving the emotional discrimination of the acoustic features. This can be achieved by estimating the normalization parameters using only neutral speech, and then applying the coefficients to the entire corpus (including emotional set). Specifically, this paper introduces a feature normalization scheme that implements these ideas by iteratively detecting neutral speech and normalizing the features. As the approximation error of the normalization parameters is reduced, the accuracy of the emotion detection system increases. The accuracy of the proposed iterative approach, evaluated across three databases, is only 2.5% lower than the one trained with optimal normalization parameters, and 9.7% higher than the one trained without any normalization scheme.
  • Keywords
    emotion recognition; speech recognition; acoustic features; automatic emotion recognition; emotional neutral speech sample; emotional speech detection; iterative feature normalization; Accuracy; Acoustics; Databases; Emotion recognition; Feature extraction; Hidden Markov models; Speech; emotion recognition; emotions; feature normalization; fundamental frequency;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
  • Conference_Location
    Prague
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4577-0538-0
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2011.5947652
  • Filename
    5947652