DocumentCode :
1062192
Title :
Human Pose Tracking in Monocular Sequence Using Multilevel Structured Models
Author :
Lee, Mun Wai ; Nevatia, Ramakant
Author_Institution :
ObjectVideo Inc., Reston, VA
Volume :
31
Issue :
1
fYear :
2009
Firstpage :
27
Lastpage :
38
Abstract :
Tracking human body poses in monocular video has many important applications. The problem is challenging in realistic scenes due to background clutter, variation in human appearance and self-occlusion. The complexity of pose tracking is further increased when there are multiple people whose bodies may inter-occlude. We proposed a three-stage approach with multi-level state representation that enables a hierarchical estimation of 3D body poses. Our method addresses various issues including automatic initialization, data association, self and inter-occlusion. At the first stage, humans are tracked as foreground blobs and their positions and sizes are coarsely estimated. In the second stage, parts such as face, shoulders and limbs are detected using various cues and the results are combined by a grid-based belief propagation algorithm to infer 2D joint positions. The derived belief maps are used as proposal functions in the third stage to infer the 3D pose using data-driven Markov chain Monte Carlo. Experimental results on several realistic indoor video sequences show that the method is able to track multiple persons during complex movement including sitting and turning movements with self and inter-occlusion.
Keywords :
Markov processes; Monte Carlo methods; image representation; image sequences; pose estimation; sensor fusion; tracking; video signal processing; 2D joint positions; 3D body poses; automatic initialization; belief maps; data association; data-driven Markov chain Monte Carlo; grid-based belief propagation algorithm; human pose tracking; monocular sequence; monocular video; multilevel state representation; multilevel structured models; realistic indoor video sequences; realistic scenes; Computer vision; Image Processing and Computer Vision; Computer Simulation; Humans; Image Interpretation, Computer-Assisted; Imaging, Three-Dimensional; Models, Biological; Posture; Video Recording; Whole Body Imaging;
fLanguage :
English
Journal_Title :
Pattern Analysis and Machine Intelligence, IEEE Transactions on
Publisher :
ieee
ISSN :
0162-8828
Type :
jour
DOI :
10.1109/TPAMI.2008.35
Filename :
4447674
Link To Document :
بازگشت