مرکز منطقه ای اطلاع رساني علوم و فناوري - Pixel-Level Hand Detection in Ego-centric Videos

DocumentCode :

639566

Title :

Pixel-Level Hand Detection in Ego-centric Videos

Author :

Cheng Li ; Kitani, Kris M.

Author_Institution :

Tsinghua Univ., Beijing, China

fYear :

2013

fDate :

23-28 June 2013

Firstpage :

3570

Lastpage :

3577

Abstract :

We address the task of pixel-level hand detection in the context of ego-centric cameras. Extracting hand regions in ego-centric videos is a critical step for understanding hand-object manipulation and analyzing hand-eye coordination. However, in contrast to traditional applications of hand detection, such as gesture interfaces or sign-language recognition, ego-centric videos present new challenges such as rapid changes in illuminations, significant camera motion and complex hand-object manipulations. To quantify the challenges and performance in this new domain, we present a fully labeled indoor/outdoor ego-centric hand detection benchmark dataset containing over 200 million labeled pixels, which contains hand images taken under various illumination conditions. Using both our dataset and a publicly available ego-centric indoors dataset, we give extensive analysis of detection performance using a wide range of local appearance features. Our analysis highlights the effectiveness of sparse features and the importance of modeling global illumination. We propose a modeling strategy based on our findings and show that our model outperforms several baseline approaches.

Keywords :

feature extraction; image motion analysis; image resolution; image sensors; lighting; object detection; sign language recognition; video signal processing; camera motion; complex hand-object manipulations; ego-centric cameras; ego-centric videos; gesture interfaces; hand region extraction; hand-eye coordination analysis; hand-object manipulation understanding; illumination conditions; indoor ego-centric hand detection benchmark dataset; local appearance features; outdoor ego-centric hand detection benchmark dataset; pixel-level hand detection; sign-language recognition; Cameras; Feature extraction; Image color analysis; Lighting; Skin; Videos; Visualization; First-person Vision; hand detection;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Computer Vision and Pattern Recognition (CVPR), 2013 IEEE Conference on

Conference_Location :

Portland, OR

ISSN :

1063-6919

Type :

conf

DOI :

10.1109/CVPR.2013.458

Filename :

6619302

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=639566