• DocumentCode
    639566
  • Title

    Pixel-Level Hand Detection in Ego-centric Videos

  • Author

    Cheng Li ; Kitani, Kris M.

  • Author_Institution
    Tsinghua Univ., Beijing, China
  • fYear
    2013
  • fDate
    23-28 June 2013
  • Firstpage
    3570
  • Lastpage
    3577
  • Abstract
    We address the task of pixel-level hand detection in the context of ego-centric cameras. Extracting hand regions in ego-centric videos is a critical step for understanding hand-object manipulation and analyzing hand-eye coordination. However, in contrast to traditional applications of hand detection, such as gesture interfaces or sign-language recognition, ego-centric videos present new challenges such as rapid changes in illuminations, significant camera motion and complex hand-object manipulations. To quantify the challenges and performance in this new domain, we present a fully labeled indoor/outdoor ego-centric hand detection benchmark dataset containing over 200 million labeled pixels, which contains hand images taken under various illumination conditions. Using both our dataset and a publicly available ego-centric indoors dataset, we give extensive analysis of detection performance using a wide range of local appearance features. Our analysis highlights the effectiveness of sparse features and the importance of modeling global illumination. We propose a modeling strategy based on our findings and show that our model outperforms several baseline approaches.
  • Keywords
    feature extraction; image motion analysis; image resolution; image sensors; lighting; object detection; sign language recognition; video signal processing; camera motion; complex hand-object manipulations; ego-centric cameras; ego-centric videos; gesture interfaces; hand region extraction; hand-eye coordination analysis; hand-object manipulation understanding; illumination conditions; indoor ego-centric hand detection benchmark dataset; local appearance features; outdoor ego-centric hand detection benchmark dataset; pixel-level hand detection; sign-language recognition; Cameras; Feature extraction; Image color analysis; Lighting; Skin; Videos; Visualization; First-person Vision; hand detection;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Vision and Pattern Recognition (CVPR), 2013 IEEE Conference on
  • Conference_Location
    Portland, OR
  • ISSN
    1063-6919
  • Type

    conf

  • DOI
    10.1109/CVPR.2013.458
  • Filename
    6619302