DocumentCode :
1050757
Title :
Unsupervised Activity Perception in Crowded and Complicated Scenes Using Hierarchical Bayesian Models
Author :
Wang, Xiaogang ; Ma, Xiaoxu ; Grimson, W. Eric L
Author_Institution :
Comput. Sci. & Artificial Intell. Lab., Massachusetts Inst. of Technol., Cambridge, MA
Volume :
31
Issue :
3
fYear :
2009
fDate :
3/1/2009 12:00:00 AM
Firstpage :
539
Lastpage :
555
Abstract :
We propose a novel unsupervised learning framework to model activities and interactions in crowded and complicated scenes. Hierarchical Bayesian models are used to connect three elements in visual surveillance: low-level visual features, simple "atomic" activities, and interactions. Atomic activities are modeled as distributions over low-level visual features, and multi-agent interactions are modeled as distributions over atomic activities. These models are learnt in an unsupervised way. Given a long video sequence, moving pixels are clustered into different atomic activities and short video clips are clustered into different interactions. In this paper, we propose three hierarchical Bayesian models, Latent Dirichlet Allocation (LDA) mixture model, Hierarchical Dirichlet Process (HDP) mixture model, and Dual Hierarchical Dirichlet Processes (Dual-HDP) model. They advance existing language models, such as LDA [1] and HDP [2]. Our data sets are challenging video sequences from crowded traffic scenes and train station scenes with many kinds of activities co-occurring. Without tracking and human labeling effort, our framework completes many challenging visual surveillance tasks of board interest such as: (1) discovering typical atomic activities and interactions; (2) segmenting long video sequences into different interactions; (3) segmenting motions into different activities; (4) detecting abnormality; and (5) supporting high-level queries on activities and interactions.
Keywords :
Bayes methods; image segmentation; learning (artificial intelligence); video surveillance; complicated scenes; crowded traffic scenes; dual hierarchical Dirichlet processes model; hierarchical Bayesian models; hierarchical Dirichlet process mixture model; language models; latent dirichlet allocation mixture model; motion segmentation; multiagent interactions; train station scenes; unsupervised activity perception; unsupervised learning framework; video clips; video sequence; visual features; visual surveillance tasks; Algorithms; Applications; Artificial Intelligence; Clustering; Computer vision; Computing Methodologies; Machine learning; Motion; Pattern Recognition; Statistical; Video analysis; Vision and Scene Understanding; Algorithms; Artificial Intelligence; Bayes Theorem; Computer Simulation; Humans; Image Enhancement; Image Interpretation, Computer-Assisted; Models, Biological; Models, Statistical; Pattern Recognition, Automated; Reproducibility of Results; Sensitivity and Specificity; Subtraction Technique; Whole Body Imaging;
fLanguage :
English
Journal_Title :
Pattern Analysis and Machine Intelligence, IEEE Transactions on
Publisher :
ieee
ISSN :
0162-8828
Type :
jour
DOI :
10.1109/TPAMI.2008.87
Filename :
4731265
Link To Document :
بازگشت