DocumentCode :
567643
Title :
Learning the fusion of audio and video aggression assessment by meta-information from human annotations
Author :
Lefter, Iulia ; Burghouts, Gertjan J. ; Rothkrantz, Leon J M
Author_Institution :
Delft Univ. of Technol., Delft, Netherlands
fYear :
2012
fDate :
9-12 July 2012
Firstpage :
1527
Lastpage :
1533
Abstract :
The focus of this paper is finding a method to predict aggression using a multimodal system, given multiple unimodal features. The mechanism underlying multimodal sensor fusion is complex and not completely clear. We try to understand the process of fusion and make it more transparent. As a case study we use a database with audio-visual recordings of aggressive behavior in trains. We have collected multi- and unimodal assessments by humans, who have given aggression scores on a 3 point scale. There are no trivial fusion steps to predict the multimodal labels from the unimodal labels. We propose an intermediate step to discover the structure in the fusion process. We call these meta-features and we find a set of five which have an impact on the fusion process. Using a propositional rule based learner we show the high positive impact of the meta-features on predicting the multimodal label for the complex situations in which the labels for audio, video and multimodal do not reinforce each other. We continue with an experiment by which we prove the added value of such an approach on the whole data set.
Keywords :
audio-visual systems; data structures; learning (artificial intelligence); prediction theory; sensor fusion; video signal processing; aggression scores; aggressive behavior; audio aggression assessment fusion; audio-visual recordings; complex situations; fusion process structure; human annotations; meta information; multimodal labels prediction; multimodal sensor fusion; multiple unimodal features; propositional rule-based learner; video aggression assessment fusion; Conductors; Context; Databases; History; Humans; Semantics; Streaming media; aggression; meta-features; multimodal fusion;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information Fusion (FUSION), 2012 15th International Conference on
Conference_Location :
Singapore
Print_ISBN :
978-1-4673-0417-7
Electronic_ISBN :
978-0-9824438-4-2
Type :
conf
Filename :
6290490
Link To Document :
بازگشت