DocumentCode :
1765200
Title :
Parsing the Hand in Depth Images
Author :
Hui Liang ; Junsong Yuan ; Thalmann, Daniel
Author_Institution :
There Centre, Nanyang Technol. Univ., Singapore, Singapore
Volume :
16
Issue :
5
fYear :
2014
fDate :
Aug. 2014
Firstpage :
1241
Lastpage :
1253
Abstract :
Hand pose tracking and gesture recognition are useful for human-computer interaction, while a major problem is the lack of discriminative features for compact hand representation. We present a robust hand parsing scheme to extract a high-level description of the hand from the depth image. A novel distance-adaptive selection method is proposed to get more discriminative depth-context features. Besides, we propose a Superpixel-Markov Random Field (SMRF) parsing scheme to enforce the spatial smoothness and the label co-occurrence prior to remove the misclassified regions. Compared to pixel-level filtering, the SMRF scheme is more suitable to model the misclassified regions. By fusing the temporal constraints, its performance can be further improved. Overall, the proposed hand parsing scheme is accurate and efficient. The tests on synthesized dataset show it gives much higher accuracy for single-frame parsing and enhanced robustness for continuous sequence parsing compared to benchmarks. The tests on real-world depth images of the hand and human body show the robustness to complex hand configurations of our method and its generalization power to different kinds of articulated objects.
Keywords :
Markov processes; computer vision; feature extraction; gesture recognition; grammars; human computer interaction; image classification; image representation; image sequences; object tracking; pose estimation; SMRF parsing scheme; articulated objects; compact hand representation; complex hand configurations; continuous sequence parsing; depth images; discriminative depth-context features; discriminative features; distance-adaptive selection method; generalization power; gesture recognition; hand high-level description extraction; hand pose tracking; human body; human-computer interaction; label cooccurrence; misclassified region removal; pixel-level filtering; robust hand parsing scheme; robustness; single-frame parsing; spatial smoothness; superpixel-Markov random field parsing scheme; temporal constraints; Cameras; Gesture recognition; Image color analysis; Markov random fields; Resource description framework; Robustness; Shape; Depth-context feature; Markov random field; hand parsing;
fLanguage :
English
Journal_Title :
Multimedia, IEEE Transactions on
Publisher :
ieee
ISSN :
1520-9210
Type :
jour
DOI :
10.1109/TMM.2014.2306177
Filename :
6740010
Link To Document :
بازگشت