DocumentCode
3018979
Title
Enhanced Level Building Algorithm for the Movement Epenthesis Problem in Sign Language Recognition
Author
Yang, Ruiduo ; Sarkar, Sudeep ; Loeding, Barbara
Author_Institution
Univ. of South Florida, Tampa
fYear
2007
fDate
17-22 June 2007
Firstpage
1
Lastpage
8
Abstract
One of the hard problems in automated sign language recognition is the movement epenthesis (me) problem. Movement epenthesis is the gesture movement that bridges two consecutive signs. This effect can be over a long duration and involve variations in hand shape, position, and movement, making it hard to explicitly model these intervening segments. This creates a problem when trying to match individual signs to full sign sentences since for many chunks of the sentence, corresponding to these mes, we do not have models. We present an approach based on version of a dynamic programming framework, called Level Building, to simultaneously segment and match signs to continuous sign language sentences in the presence of movement epenthesis (me). We enhance the classical Level Building framework so that it can accommodate me labels for which we do not have explicit models. This enhanced Level Building algorithm is then coupled with a trigram grammar model to optimally segment and label sign language sentences. We demonstrate the efficiency of the algorithm using a single view video dataset of continuous sign language sentences. We obtain 83% word level recognition rate with the enhanced Level Building approach, as opposed to a 20% recognition rate using a classical Level Building framework on the same dataset. The proposed approach is novel since it does not need explicit models for movement epenthesis.
Keywords
dynamic programming; gesture recognition; image matching; dynamic programming; enhanced level building algorithm; gesture movement; movement epenthesis problem; sign language recognition; sign matching; trigram grammar model; Bridges; Computer science; Computer science education; Dynamic programming; Handicapped aids; Hidden Markov models; Humans; Lakes; Shape; Speech;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer Vision and Pattern Recognition, 2007. CVPR '07. IEEE Conference on
Conference_Location
Minneapolis, MN
ISSN
1063-6919
Print_ISBN
1-4244-1179-3
Electronic_ISBN
1063-6919
Type
conf
DOI
10.1109/CVPR.2007.383347
Filename
4270345
Link To Document