مرکز منطقه ای اطلاع رساني علوم و فناوري - Latent structured models for human pose estimation

DocumentCode :

2958688

Title :

Latent structured models for human pose estimation

Author :

Ionescu, Catalin ; Li, Fuxin ; Sminchisescu, Cristian

Author_Institution :

Fac. of Math. & Natural Sci., Univ. of Bonn, Bonn, Germany

fYear :

2011

fDate :

6-13 Nov. 2011

Firstpage :

2220

Lastpage :

2227

Abstract :

We present an approach for automatic 3D human pose reconstruction from monocular images, based on a discriminative formulation with latent segmentation inputs. We advanced the field of structured prediction and human pose reconstruction on several fronts. First, by working with a pool of figure-ground segment hypotheses, the prediction problem is formulated in terms of combined learning and inference over segment hypotheses and 3D human articular configurations. Beside constructing tractable formulations for the combined segment selection and pose estimation problem, we propose new augmented kernels that can better encode complex dependencies between output variables. Furthermore, we provide primal linear re-formulations based on Fourier kernel approximations, in order to scale-up the non-linear latent structured prediction methodology. The proposed models are shown to be competitive in the HumanEva benchmark and are also illustrated in a clip collected from a Hollywood movie, where the model can infer human poses from monocular images captured in complex environments.

Keywords :

approximation theory; image reconstruction; image segmentation; inference mechanisms; learning (artificial intelligence); pose estimation; prediction theory; 3D human articular configuration; Fourier kernel approximation; Hollywood movie; HumanEva benchmark; augmented kernel; automatic 3D human pose reconstruction; combined segment selection; complex dependency encoding; discriminative formulation; figure-ground segment hypothesis; human pose estimation; inference; latent segmentation inputs; latent structured model; learning; monocular images; nonlinear latent structured prediction methodology; prediction problem; primal linear reformulation; tractable formulation; Estimation; Humans; Image segmentation; Joints; Kernel; Three dimensional displays; Training;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Computer Vision (ICCV), 2011 IEEE International Conference on

Conference_Location :

Barcelona

ISSN :

1550-5499

Print_ISBN :

978-1-4577-1101-5

Type :

conf

DOI :

10.1109/ICCV.2011.6126500

Filename :

6126500

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2958688