Title :
CASSANDRA: audio-video sensor fusion for aggression detection
Author :
Zajde, W. ; Krijnders, J.D. ; Andringa, T. ; Gavrila, D.M.
Author_Institution :
Univ. of Amsterdam, Amsterdam
Abstract :
This paper presents a smart surveillance system named CASSANDRA, aimed at detecting instances of aggressive human behavior in public environments. A distinguishing aspect of CASSANDRA is the exploitation of the complimentary nature of audio and video sensing to disambiguate scene activity in real-life, noisy and dynamic environments. At the lower level, independent analysis of the audio and video streams yields intermediate descriptors of a scene like: "scream", "passing train" or "articulation energy". At the higher level, a Dynamic Bayesian Network is used as a fusion mechanism that produces an aggregate aggression indication for the current scene. Our prototype system is validated on a set of scenarios performed by professional actors at an actual train station to ensure a realistic audio and video noise setting.
Keywords :
audio signal processing; gesture recognition; sensor fusion; video signal processing; video surveillance; CASSANDRA smart surveillance system; aggressive human behavior detection; articulation energy; audio streams; audio-video sensor fusion; dynamic Bayesian network; dynamic environments; human activity recognition; passing train; public environments; scream-like cues; video streams; Aggregates; Artificial intelligence; Bayesian methods; Frequency; Humans; Kinetic energy; Layout; Sensor fusion; Speech; Surveillance;
Conference_Titel :
Advanced Video and Signal Based Surveillance, 2007. AVSS 2007. IEEE Conference on
Conference_Location :
London
Print_ISBN :
978-1-4244-1696-7
Electronic_ISBN :
978-1-4244-1696-7
DOI :
10.1109/AVSS.2007.4425310