DocumentCode :
2291762
Title :
Video Scene Understanding Using Multi-scale Analysis
Author :
Yang, Yang ; Liu, Jingen ; Shah, Mubarak
Author_Institution :
Comput. Vision Lab., Univ. of Central Florida, Orlando, FL, USA
fYear :
2009
fDate :
Sept. 29 2009-Oct. 2 2009
Firstpage :
1669
Lastpage :
1676
Abstract :
We propose a novel method for automatically discovering key motion patterns happening in a scene by observing the scene for an extended period. Our method does not rely on object detection and tracking, and uses low level features, the direction of pixel wise optical flow. We first divide the video into clips and estimate a sequence of flow-fields. Each moving pixel is quantized based on its location and motion direction. This is essentially a bag of words representation of clips. Once a bag of words representation is obtained, we proceed to the screening stage, using a measure called the `conditional entropy´. After obtaining useful words we apply Diffusion maps. Diffusion maps framework embeds the manifold points into a lower dimensional space while preserving the intrinsic local geometric structure. Finally, these useful words in lower dimensional space are clustered to discover key motion patterns. Diffusion map embedding involves diffusion time parameter which gives us ability to detect key motion patterns at different scales using multi-scale analysis. In addition, clips which are represented in terms of frequency of motion patterns can also be clustered to determine multiple dominant motion patterns which occur simultaneously, providing us further understanding of the scene. We have tested our approach on two challenging datasets and obtained interesting and promising results.
Keywords :
entropy; image motion analysis; image sequences; object detection; bag of words representation; conditional entropy; diffusion maps; diffusion time parameter; geometric structure; multiple dominant motion patterns; multiscale analysis; object detection; object tracking; pixelwise optical flow; video scene understanding; Cameras; Computer vision; Layout; Motion analysis; Motion detection; Object detection; Pattern analysis; Roads; Traffic control; Unmanned aerial vehicles;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Vision, 2009 IEEE 12th International Conference on
Conference_Location :
Kyoto
ISSN :
1550-5499
Print_ISBN :
978-1-4244-4420-5
Electronic_ISBN :
1550-5499
Type :
conf
DOI :
10.1109/ICCV.2009.5459376
Filename :
5459376
Link To Document :
بازگشت