Title :
A novel lip localization method based on shiftable wavelets transform
Author :
Yanjun, XU ; Limin, Du ; Ziqiang, Hou
Author_Institution :
Inst. of Acoust., Acad. Sinica, Beijing, China
Abstract :
Visual feature extraction is one of the most important techniques in audiovisual bimodal speech recognition, and also remains a very challenging area in image understanding. A shiftable multiscale transform is introduced into the construction of an active shape model. It uses the pyramidal data to describe the structure of an image, which is invariant to illumination and perspective variability and thus contributes a lot to the improvement of the robustness of the model. A segmental downhill simplex method is also put forward to improve the minimization procedure of lip localization. It employs a kind of “coarse-to-fine” strategy to speed up the convergence and improve the robustness of lip localization. Experiments support the validity of the new method, and show better robustness and higher efficiency
Keywords :
computational geometry; convergence of numerical methods; feature extraction; image resolution; minimisation; speech recognition; wavelet transforms; active shape model construction; audiovisual bimodal speech recognition; convergence; image structure; image understanding; lip localization; minimization; pyramidal data; segmental downhill simplex method; shiftable multiscale transform; shiftable wavelet transform; visual feature extraction; Acoustic waves; Active shape model; Covariance matrix; Deformable models; Eigenvalues and eigenfunctions; Feature extraction; Lighting; Noise robustness; Speech; Wavelet transforms;
Conference_Titel :
Signal Processing Proceedings, 1998. ICSP '98. 1998 Fourth International Conference on
Conference_Location :
Beijing
Print_ISBN :
0-7803-4325-5
DOI :
10.1109/ICOSP.1998.770790