Title :
Speaker identification by lipreading
Author :
Luettin, Juergen ; Thacker, Neil A. ; Beet, S.W.
Author_Institution :
Dept. of Electron. & Electr. Eng., Sheffield Univ., UK
Abstract :
This paper describes a new approach for speaker identification based on lipreading. Visual features are extracted from image sequences of the talking face and consist of shape parameters which describe the lip boundary and intensity parameters which describe the grey-level distribution of the mouth area. Intensity information is based on principal component analysis using eigenspaces which deform with the shape model. The extracted parameters account for both, speech dependent and speaker dependent information. We built spatio-temporal speaker models based on these features, using HMMs with mixtures of Gaussians. Promising results were obtained for text dependent and text independent speaker identification tests performed on a small video database
Keywords :
Gaussian processes; edge detection; eigenvalues and eigenfunctions; feature extraction; hidden Markov models; image sequences; speaker recognition; statistical analysis; visual databases; Gaussian mixture; HMM; eigenspaces; grey-level distribution; hidden Markov model; image sequences; intensity parameters; lip boundary; lipreading; mouth area; principal component analysis; shape model; shape parameters; spatio-temporal speaker models; speaker dependent information; speaker identification; speech dependent information; talking face; video database; visual feature extraction; Data mining; Deformable models; Feature extraction; Gaussian processes; Hidden Markov models; Image sequences; Mouth; Principal component analysis; Shape; Speech;
Conference_Titel :
Spoken Language, 1996. ICSLP 96. Proceedings., Fourth International Conference on
Conference_Location :
Philadelphia, PA
Print_ISBN :
0-7803-3555-4
DOI :
10.1109/ICSLP.1996.607030