DocumentCode
2954394
Title
Efficient regression of general-activity human poses from depth images
Author
Girshick, Ross ; Shotton, Jamie ; Kohli, Pushmeet ; Criminisi, Antonio ; Fitzgibbon, Andrew
Author_Institution
Microsoft Res. Cambridge, Cambridge, UK
fYear
2011
fDate
6-13 Nov. 2011
Firstpage
415
Lastpage
422
Abstract
We present a new approach to general-activity human pose estimation from depth images, building on Hough forests. We extend existing techniques in several ways: real time prediction of multiple 3D joints, explicit learning of voting weights, vote compression to allow larger training sets, and a comparison of several decision-tree training objectives. Key aspects of our work include: regression directly from the raw depth image, without the use of an arbitrary intermediate representation; applicability to general motions (not constrained to particular activities) and the ability to localize occluded as well as visible body joints. Experimental results demonstrate that our method produces state of the art results on several data sets including the challenging MSRC-5000 pose estimation test set, at a speed of about 200 frames per second. Results on silhouettes suggest broader applicability to other imaging modalities.
Keywords
decision trees; image representation; learning (artificial intelligence); pose estimation; regression analysis; Hough forest; MSRC-5000 pose estimation test set; arbitrary intermediate representation; decision-tree training objective; depth image; explicit learning; general-activity human pose regression; imaging modality; multiple 3D joint; visible body joint; voting weight; Accuracy; Estimation; Joints; Regression tree analysis; Three dimensional displays; Training; Vegetation;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer Vision (ICCV), 2011 IEEE International Conference on
Conference_Location
Barcelona
ISSN
1550-5499
Print_ISBN
978-1-4577-1101-5
Type
conf
DOI
10.1109/ICCV.2011.6126270
Filename
6126270
Link To Document