Title :
Parallel hidden Markov models for American sign language recognition
Author :
Vogler, Christian ; Metaxas, Dimitris
Author_Institution :
Dept. of Comput. & Inf. Sci., Pennsylvania Univ., Philadelphia, PA, USA
Abstract :
The major challenge that faces American Sign Language (ASL) recognition now is to develop methods that will scale well with increasing vocabulary size. Unlike in spoken languages, phonemes can occur simultaneously in ASL. The number of possible combinations of phonemes after enforcing linguistic constraints is approximately 5.5×108. Gesture recognition, which is less constrained than ASL recognition, suffers from the same problem. Thus, it is not feasible to train conventional hidden Markov models (HMMs) for large-scab ASL applications. Factorial HMMs and coupled HMMs are two extensions to HMMs that explicitly attempt to model several processes occuring in parallel. Unfortunately, they still require consideration of the combinations at training time. In this paper we present a novel approach to ASL recognition that aspires to being a solution to the scalability problems. It is based on parallel HMMs (PaHMMs), which model the parallel processes independently. Thus, they can also be trained independently, and do not require consideration of the different combinations at training time. We develop the recognition algorithm for PaHMMs and show that it runs in time polynomial in the number of states, and in time linear in the number of parallel processes. We run several experiments with a 22 sign vocabulary and demonstrate that PaHMMs can improve the robustness of HMM-based recognition even on a small scale. Thus, PaHMMs are a very promising general recognition scheme with applications in both gesture and ASL recognition
Keywords :
gesture recognition; hidden Markov models; natural language interfaces; ASL recognition; American sign language recognition; gesture recognition; hidden Markov models; linguistic constraints; scalability; Application software; Ear; Electronic switching systems; Face recognition; Handicapped aids; Hidden Markov models; Humans; Information science; Read only memory; Speech recognition;
Conference_Titel :
Computer Vision, 1999. The Proceedings of the Seventh IEEE International Conference on
Conference_Location :
Kerkyra
Print_ISBN :
0-7695-0164-8
DOI :
10.1109/ICCV.1999.791206