DocumentCode :
827808
Title :
A Unified Framework for Gesture Recognition and Spatiotemporal Gesture Segmentation
Author :
Alon, Jonathan ; Athitsos, Vassilis ; Yuan, Quan ; Sclaroff, Stan
Author_Institution :
Comput. Sci. Dept., Boston Univ., Boston, MA, USA
Volume :
31
Issue :
9
fYear :
2009
Firstpage :
1685
Lastpage :
1699
Abstract :
Within the context of hand gesture recognition, spatiotemporal gesture segmentation is the task of determining, in a video sequence, where the gesturing hand is located and when the gesture starts and ends. Existing gesture recognition methods typically assume either known spatial segmentation or known temporal segmentation, or both. This paper introduces a unified framework for simultaneously performing spatial segmentation, temporal segmentation, and recognition. In the proposed framework, information flows both bottom-up and top-down. A gesture can be recognized even when the hand location is highly ambiguous and when information about when the gesture begins and ends is unavailable. Thus, the method can be applied to continuous image streams where gestures are performed in front of moving, cluttered backgrounds. The proposed method consists of three novel contributions: a spatiotemporal matching algorithm that can accommodate multiple candidate hand detections in every frame, a classifier-based pruning framework that enables accurate and early rejection of poor matches to gesture models, and a subgesture reasoning algorithm that learns which gesture models can falsely match parts of other longer gestures. The performance of the approach is evaluated on two challenging applications: recognition of hand-signed digits gestured by users wearing short-sleeved shirts, in front of a cluttered background, and retrieval of occurrences of signs of interest in a video database containing continuous, unsegmented signing in American sign language (ASL).
Keywords :
gesture recognition; image classification; image matching; image segmentation; image sequences; inference mechanisms; learning (artificial intelligence); object detection; video signal processing; American sign language; classifier-based pruning framework; continuous image stream; gesture model learning; gesture recognition; hand detection; spatiotemporal gesture segmentation; spatiotemporal matching algorithm; subgesture reasoning algorithm; video sequence; Computer vision; Gesture recognition; Human-centered computing; continuous dynamic programming.; dynamic time warping; gesture spotting; human motion analysis; Algorithms; Artificial Intelligence; Gestures; Hand; Humans; Image Enhancement; Image Interpretation, Computer-Assisted; Imaging, Three-Dimensional; Pattern Recognition, Automated; Reproducibility of Results; Sensitivity and Specificity; Sign Language;
fLanguage :
English
Journal_Title :
Pattern Analysis and Machine Intelligence, IEEE Transactions on
Publisher :
ieee
ISSN :
0162-8828
Type :
jour
DOI :
10.1109/TPAMI.2008.203
Filename :
4589214
Link To Document :
بازگشت