DocumentCode :
3020909
Title :
Inferring social roles in long timespan video sequence
Author :
Zhang, Jiangen ; Hu, Wenze ; Yao, Benjamin ; Wang, Yongtian ; Zhu, Song-Chun
Author_Institution :
Sch. of Comput. Sci. & Technol., Beijing Inst. of Technol., Beijing, China
fYear :
2011
fDate :
6-13 Nov. 2011
Firstpage :
1456
Lastpage :
1463
Abstract :
In this paper, we present a method for inferring social roles of agents (persons) from their daily activities in long surveillance video sequences. We define activities as interactions between an agent´s position and semantic hotspots within the scene. Given a surveillance video, our method first tracks the locations of agents then automatically discovers semantic hotspots in the scene. By enumerating spatial/temporal locations between an agent´s feet and hotspots in a scene, we define a set of atomic actions, which in turn compose sub-events and events. The numbers and types of events performed by an agent are assumed to be driven by his/her social role. With the grammar model induced by composition rules, an adapted Earley parser algorithm is used to parse the trajectories into events, sub-events and atomic actions. With probabilistic output of events, the roles of agents can be predicted under the Bayesian inference framework. Experiments are carried out on a challenging 8.5 hours video from a surveillance camera in the lobby of a research lab. The video contains 7 different social roles including “manager”, “researcher”, “developer”, “engineer”, “staff”, “visitor” and “mailman”. Results show that our proposed method can predict the role of each agent with high precision.
Keywords :
Bayes methods; grammars; image sequences; social sciences computing; video signal processing; video surveillance; Bayesian inference; Earley parser algorithm; atomic action; composition rule; grammar model; long timespan video sequence; semantic hotspot; social role; spatial location enumeration; surveillance video; temporal location enumeration; Atomic layer deposition; Grammar; Hidden Markov models; Semantics; Surveillance; Trajectory; Video sequences;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Vision Workshops (ICCV Workshops), 2011 IEEE International Conference on
Conference_Location :
Barcelona
Print_ISBN :
978-1-4673-0062-9
Type :
conf
DOI :
10.1109/ICCVW.2011.6130422
Filename :
6130422
Link To Document :
بازگشت