• DocumentCode
    1765200
  • Title

    Parsing the Hand in Depth Images

  • Author

    Hui Liang ; Junsong Yuan ; Thalmann, Daniel

  • Author_Institution
    There Centre, Nanyang Technol. Univ., Singapore, Singapore
  • Volume
    16
  • Issue
    5
  • fYear
    2014
  • fDate
    Aug. 2014
  • Firstpage
    1241
  • Lastpage
    1253
  • Abstract
    Hand pose tracking and gesture recognition are useful for human-computer interaction, while a major problem is the lack of discriminative features for compact hand representation. We present a robust hand parsing scheme to extract a high-level description of the hand from the depth image. A novel distance-adaptive selection method is proposed to get more discriminative depth-context features. Besides, we propose a Superpixel-Markov Random Field (SMRF) parsing scheme to enforce the spatial smoothness and the label co-occurrence prior to remove the misclassified regions. Compared to pixel-level filtering, the SMRF scheme is more suitable to model the misclassified regions. By fusing the temporal constraints, its performance can be further improved. Overall, the proposed hand parsing scheme is accurate and efficient. The tests on synthesized dataset show it gives much higher accuracy for single-frame parsing and enhanced robustness for continuous sequence parsing compared to benchmarks. The tests on real-world depth images of the hand and human body show the robustness to complex hand configurations of our method and its generalization power to different kinds of articulated objects.
  • Keywords
    Markov processes; computer vision; feature extraction; gesture recognition; grammars; human computer interaction; image classification; image representation; image sequences; object tracking; pose estimation; SMRF parsing scheme; articulated objects; compact hand representation; complex hand configurations; continuous sequence parsing; depth images; discriminative depth-context features; discriminative features; distance-adaptive selection method; generalization power; gesture recognition; hand high-level description extraction; hand pose tracking; human body; human-computer interaction; label cooccurrence; misclassified region removal; pixel-level filtering; robust hand parsing scheme; robustness; single-frame parsing; spatial smoothness; superpixel-Markov random field parsing scheme; temporal constraints; Cameras; Gesture recognition; Image color analysis; Markov random fields; Resource description framework; Robustness; Shape; Depth-context feature; Markov random field; hand parsing;
  • fLanguage
    English
  • Journal_Title
    Multimedia, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1520-9210
  • Type

    jour

  • DOI
    10.1109/TMM.2014.2306177
  • Filename
    6740010