DocumentCode
3210071
Title
3D human pose from silhouettes by relevance vector regression
Author
Agarwal, Ankur ; Triggs, Bill
Author_Institution
GRAVIR-INRIA-CNRS, Montbonnot, France
Volume
2
fYear
2004
fDate
27 June-2 July 2004
Abstract
We describe a learning based method for recovering 3D human body pose from single images and monocular image sequences. Our approach requires neither an explicit body model nor prior labelling of body pans in the image. Instead, it recovers pose by direct nonlinear regression against shape descriptor vectors extracted automatically from image silhouettes. For robustness against local silhouette segmentation errors, silhouette shape is encoded by histogram-of-shape-contexts descriptors. For the main regression, we evaluate both regularized least squares and relevance vector machine (RVM) regressors over both linear and kernel bases. The RVM´s provide much sparser regressors without compromising performance, and kernel bases give a small but worthwhile improvement in performance. For realism and good generalization with respect to viewpoints, we train the regressors on images resynthesized from real human motion capture data, and test it both quantitatively on similar independent test data, and qualitatively on a real image sequence. Mean angular errors of 6-7 degrees are obtained - a factor of 3 better than the current state of the art for the much simpler upper body problem.
Keywords
errors; image segmentation; image sequences; learning (artificial intelligence); least squares approximations; motion estimation; multidimensional signal processing; regression analysis; 3D human pose; direct nonlinear regression; histogram-of-shape-contexts descriptors; human motion capture data; image silhouettes; learning based method; mean angular errors; monocular image sequences; regularized least squares; relevance vector machine; relevance vector regression; shape descriptor vectors; silhouette segmentation errors; Biological system modeling; Humans; Image sequences; Kernel; Labeling; Learning systems; Robustness; Shape; Testing; Vectors;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer Vision and Pattern Recognition, 2004. CVPR 2004. Proceedings of the 2004 IEEE Computer Society Conference on
ISSN
1063-6919
Print_ISBN
0-7695-2158-4
Type
conf
DOI
10.1109/CVPR.2004.1315258
Filename
1315258
Link To Document