Title :
Location normalization of HMM-based lip-reading: experiments for the M2VTS database
Author :
Vanegas, O. ; Tokuda, Keiichi ; Kitamura, Takamitsu
Author_Institution :
Dept. of Comput. Sci., Nagoya Inst. of Technol., Japan
Abstract :
This paper describes an HMM-based lip location normalization process, in order to improve the recognition performance in automatic lip-reading. This paper uses the image-based method in order to represent the lip visual information. One of the most critical factors which affect the recognition results in image-based method is the position of lips in frames. This paper describes a method to normalize the lip location which is similar to SAT (speaker adaptive training), and presents several experiments which were carried out in order to measure the effectiveness of the proposed method. Experiments of isolated words with and without the original movement from speakers were carried out on the M2VTS database.
Keywords :
feature extraction; hidden Markov models; maximum likelihood estimation; speech recognition; M2VTS database; hidden Markov model based lip-reading; image-based method; isolated words; lip location normalization process; location normalization; speaker adaptive training; Computer science; Data mining; Hidden Markov models; Image databases; Image recognition; Lips; Robustness; Speech recognition; Tracking; Visual databases;
Conference_Titel :
Image Processing, 1999. ICIP 99. Proceedings. 1999 International Conference on
Conference_Location :
Kobe
Print_ISBN :
0-7803-5467-2
DOI :
10.1109/ICIP.1999.822914