DocumentCode :
1444087
Title :
Human Pose Regression Through Multiview Visual Fusion
Author :
Zhao, Xu ; Fu, Yun ; Ning, Huazhong ; Liu, Yuncai ; Huang, Thomas S.
Author_Institution :
Inst. of Image Process. & Pattern Recognition, Shanghai Jiao Tong Univ., Shanghai, China
Volume :
20
Issue :
7
fYear :
2010
fDate :
7/1/2010 12:00:00 AM
Firstpage :
957
Lastpage :
966
Abstract :
We consider the problem of estimating 3-D human body pose from visual signals within a discriminative framework. It is challenging because there is a wide gap between complex 3-D human motion and planar visual observation, which makes this a severely ill-conditioned problem. In this paper, we focus on three critical factors to tackle human body pose estimation, namely, feature extraction, learning algorithm, and camera utilization. On the feature level, we describe images using the salient interest points represented by scale-invariant feature transform (SIFT)-like descriptors, in which the position, appearance, and local structural information are encoded simultaneously. On the learning algorithm level, we propose to use Gaussian processes and multiple linear (ML) regression to model the mapping between poses and features. Fusing image information from multiple cameras in different views is of great interest to us on the camera level. We make a comprehensive evaluation on the HumanEva database and get two meaningful insights into the three crucial aspects for human pose estimation: 1) although the choice of feature is very important to the problem, once the learning algorithm becomes efficient, the choice of feature is no longer critical, and 2) the impact of information combination from multiple cameras on pose estimation is closely related to not only the quantity of image information, but also its quality. In most cases, it is true that the more information is involved, the better results can be achieved. But when the information quantity is the same, the differences in quality will lead to totally different performance. Furthermore, dense evaluations demonstrate that our approach is an accurate and robust solution to the human body pose estimation problem.
Keywords :
Gaussian processes; computer vision; feature extraction; image fusion; image motion analysis; pose estimation; regression analysis; 3D human motion; Gaussian process; HumanEva database; SIFT-like descriptor; camera utilization; feature extraction; human pose regression; learning algorithm; multiple linear regression; multiview visual fusion; scale-invariant feature transform; Gaussian processes regression; human pose estimation; image feature; multiple views;
fLanguage :
English
Journal_Title :
Circuits and Systems for Video Technology, IEEE Transactions on
Publisher :
ieee
ISSN :
1051-8215
Type :
jour
DOI :
10.1109/TCSVT.2010.2045916
Filename :
5433014
Link To Document :
بازگشت