• DocumentCode
    3703305
  • Title

    Cross-corpus analysis for acoustic recognition of negative interactions

  • Author

    Iulia Lefter;Harold T. Nefs;Catholijn M. Jonker;Leon J. M. Rothkrantz

  • Author_Institution
    Delft University of Technology, Delft, The Netherlands
  • fYear
    2015
  • Firstpage
    132
  • Lastpage
    138
  • Abstract
    Recent years have witnessed a growing interest in recognizing emotions and events based on speech. One of the applications of such systems is automatically detecting when a situations gets out of hand and human intervention is needed. Most studies have focused on increasing recognition accuracies using parts of the same dataset for training and testing. However, this says little about how such a trained system is expected to perform `in the wild´. In this paper we present a cross-corpus study using the audio part of three multimodal datasets containing negative human-human interactions. We present intra- and cross-corpus accuracies whilst manipulating the acoustic features, normalization schemes, and oversampling of the least represented class to alleviate the negative effects of data unbalance. We observe a decrease in performance when disjunct corpora are used for training and testing. Merging two datasets for training results in a slightly lower performance than the best one obtained by using only one corpus for training. A hand crafted low dimensional feature set shows competitive behavior when compared to a brute force high dimensional features vector. Corpus normalization and artificially creating samples of the sparsest class have a positive effect.
  • Keywords
    "Speech recognition","Speech","Training","Emotion recognition","Surveillance","Acoustics","Stress"
  • Publisher
    ieee
  • Conference_Titel
    Affective Computing and Intelligent Interaction (ACII), 2015 International Conference on
  • Electronic_ISBN
    2156-8111
  • Type

    conf

  • DOI
    10.1109/ACII.2015.7344562
  • Filename
    7344562