• DocumentCode
    1668757
  • Title

    Identifying salient sub-utterance emotion dynamics using flexible units and estimates of affective flow

  • Author

    Provost, Emily Mower

  • Author_Institution
    Comput. Sci. & Eng., Univ. of Michigan, Ann Arbor, MI, USA
  • fYear
    2013
  • Firstpage
    3682
  • Lastpage
    3686
  • Abstract
    Emotion recognition is the process of identifying the affective characteristics of an utterance given either static or dynamic descriptions of its signal content. This requires the use of units, windows over which the emotion variation is quantified. However, the appropriate time scale for these units is still an open question. Traditionally, emotion recognition systems have relied upon units of fixed length, whose variation is then modeled over time. This paper takes the view that emotion is expressed over units of variable length. In this paper, variable-length units are introduced and used to capture the local dynamics of emotion at the sub-utterance scale. The results demonstrate that subsets of these local dynamics are salient with respect to emotion class. These salient units provide insight into the natural variation in emotional speech and can be used in a classification framework to achieve performance comparable to the state-of-the-art. This hints at the existence of building blocks that may underlie natural human emotional communication.
  • Keywords
    emotion recognition; signal classification; speech recognition; affective characteristics; affective flow estimation; classification framework; dynamic descriptions; emotion class; emotion recognition; emotion variation; emotional speech; flexible units; local dynamics; natural human emotional communication; natural variation; salient subutterance emotion dynamics; salient units; signal content; static descriptions; subutterance scale; variable-length units; Accuracy; Databases; Emotion recognition; Hidden Markov models; Speech; Training data; Trajectory; Emotion classification; emotion profile; emotion representation; emotion unit; emotogram;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
  • Conference_Location
    Vancouver, BC
  • ISSN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2013.6638345
  • Filename
    6638345