Title :
Multi-Frame Sparse Feature Extraction for Lip-Reading
Author :
Lee, M.J. ; Soo-Young Lee ; Lee, Michelle Jeungeun
Author_Institution :
Dept. of Biosyst., Korea Adv. Inst. of Sci. & Technol., Daejeon
Abstract :
The features of human lip motion from video clips are extracted by three unsupervised learning algorithms, i.e., principle component analysis, independent component analysis, and non-negative matrix factorization. Since the human perception of facial motion goes through two different pathways, i.e., the lateral fusifom gyrus for the invariant aspects and the superior temporal sulcus for the changeable aspects of faces, we extracted the dynamic video features from multiple consecutive frames for the latter. The multiple-frame features require less number of coefficients for the same frame length than the single-frame static features, and also result in better recognition performance
Keywords :
feature extraction; gesture recognition; image motion analysis; independent component analysis; matrix decomposition; principal component analysis; unsupervised learning; human lip motion; independent component analysis; lip-reading; multi-frame sparse feature extraction; nonnegative matrix factorization; principle component analysis; unsupervised learning; Algorithm design and analysis; Face; Feature extraction; Humans; Image analysis; Independent component analysis; Motion analysis; Principal component analysis; Speech synthesis; Unsupervised learning;
Conference_Titel :
Neural Networks and Brain, 2005. ICNN&B '05. International Conference on
Conference_Location :
Beijing
Print_ISBN :
0-7803-9422-4
DOI :
10.1109/ICNNB.2005.1615004