DocumentCode :
2958688
Title :
Latent structured models for human pose estimation
Author :
Ionescu, Catalin ; Li, Fuxin ; Sminchisescu, Cristian
Author_Institution :
Fac. of Math. & Natural Sci., Univ. of Bonn, Bonn, Germany
fYear :
2011
fDate :
6-13 Nov. 2011
Firstpage :
2220
Lastpage :
2227
Abstract :
We present an approach for automatic 3D human pose reconstruction from monocular images, based on a discriminative formulation with latent segmentation inputs. We advanced the field of structured prediction and human pose reconstruction on several fronts. First, by working with a pool of figure-ground segment hypotheses, the prediction problem is formulated in terms of combined learning and inference over segment hypotheses and 3D human articular configurations. Beside constructing tractable formulations for the combined segment selection and pose estimation problem, we propose new augmented kernels that can better encode complex dependencies between output variables. Furthermore, we provide primal linear re-formulations based on Fourier kernel approximations, in order to scale-up the non-linear latent structured prediction methodology. The proposed models are shown to be competitive in the HumanEva benchmark and are also illustrated in a clip collected from a Hollywood movie, where the model can infer human poses from monocular images captured in complex environments.
Keywords :
approximation theory; image reconstruction; image segmentation; inference mechanisms; learning (artificial intelligence); pose estimation; prediction theory; 3D human articular configuration; Fourier kernel approximation; Hollywood movie; HumanEva benchmark; augmented kernel; automatic 3D human pose reconstruction; combined segment selection; complex dependency encoding; discriminative formulation; figure-ground segment hypothesis; human pose estimation; inference; latent segmentation inputs; latent structured model; learning; monocular images; nonlinear latent structured prediction methodology; prediction problem; primal linear reformulation; tractable formulation; Estimation; Humans; Image segmentation; Joints; Kernel; Three dimensional displays; Training;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Vision (ICCV), 2011 IEEE International Conference on
Conference_Location :
Barcelona
ISSN :
1550-5499
Print_ISBN :
978-1-4577-1101-5
Type :
conf
DOI :
10.1109/ICCV.2011.6126500
Filename :
6126500
Link To Document :
بازگشت