DocumentCode
3703305
Title
Cross-corpus analysis for acoustic recognition of negative interactions
Author
Iulia Lefter;Harold T. Nefs;Catholijn M. Jonker;Leon J. M. Rothkrantz
Author_Institution
Delft University of Technology, Delft, The Netherlands
fYear
2015
Firstpage
132
Lastpage
138
Abstract
Recent years have witnessed a growing interest in recognizing emotions and events based on speech. One of the applications of such systems is automatically detecting when a situations gets out of hand and human intervention is needed. Most studies have focused on increasing recognition accuracies using parts of the same dataset for training and testing. However, this says little about how such a trained system is expected to perform `in the wild´. In this paper we present a cross-corpus study using the audio part of three multimodal datasets containing negative human-human interactions. We present intra- and cross-corpus accuracies whilst manipulating the acoustic features, normalization schemes, and oversampling of the least represented class to alleviate the negative effects of data unbalance. We observe a decrease in performance when disjunct corpora are used for training and testing. Merging two datasets for training results in a slightly lower performance than the best one obtained by using only one corpus for training. A hand crafted low dimensional feature set shows competitive behavior when compared to a brute force high dimensional features vector. Corpus normalization and artificially creating samples of the sparsest class have a positive effect.
Keywords
"Speech recognition","Speech","Training","Emotion recognition","Surveillance","Acoustics","Stress"
Publisher
ieee
Conference_Titel
Affective Computing and Intelligent Interaction (ACII), 2015 International Conference on
Electronic_ISBN
2156-8111
Type
conf
DOI
10.1109/ACII.2015.7344562
Filename
7344562
Link To Document