Title :
Visual speech recognition using active shape models and hidden Markov models
Author :
Luettin, Juergen ; Thacker, Neil A. ; Beet, Steve W.
Author_Institution :
Dept. of Electron. & Electr. Eng., Sheffield Univ., UK
Abstract :
This paper describes a novel approach for visual speech recognition. The shape of the mouth is modelled by an active shape model which is derived from the statistics of a training set and used to locate, track and parameterise the speaker´s lip movements. The extracted parameters representing the lip shape are modelled as continuous probability distributions and their temporal dependencies are modelled by hidden Markov models. We present recognition tests performed on a database of a broad variety of speakers and illumination conditions. The system achieved an accuracy of 85.42% for a speaker independent recognition task of the first four digits using lip shape information only
Keywords :
feature extraction; hidden Markov models; image processing; parameter estimation; probability; speech recognition; HMM; active shape models; continuous probability distributions; database; extracted parameters; hidden Markov models; illumination conditions; lip movements; lip shape information; mouth shape; recognition accuracy; recognition tests; speaker independent recognition; temporal dependencies; training set statistics; visual speech recognition; Active shape model; Databases; Hidden Markov models; Mouth; Performance evaluation; Probability distribution; Speech recognition; Statistical distributions; Testing; Tracking;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1996. ICASSP-96. Conference Proceedings., 1996 IEEE International Conference on
Conference_Location :
Atlanta, GA
Print_ISBN :
0-7803-3192-3
DOI :
10.1109/ICASSP.1996.543246