DocumentCode :
2036008
Title :
Lipreading by Locality Discriminant Graph
Author :
Fu, Yun ; Zhou, Xi ; Liu, Ming ; Hasegawa-Johnson, Mark ; Huang, Thomas S.
Author_Institution :
Illinois Univ. Urbana-Champaign, Urbana
Volume :
3
fYear :
2007
fDate :
Sept. 16 2007-Oct. 19 2007
Abstract :
The major problem in building a good lipreading system is to extract effective visual features from the enormous quantity of video sequences data. For appearance-based feature analysis in lipreading, classical methods, e.g. DCT, PCA and LDA, are usually applied to dimensionality reduction. We present a new pattern classification algorithm, called locality discriminant graph (LDG), and develop a novel lipreading framework to successfully apply LDG to the problem. LDG takes the advantages of both manifold learning and Fisher criteria to seek the linear embedding which preserves the local neighborhood affinity within same class while discriminating the neighborhood among different classes. The LDG embedding is computed in closed-form and tuned by the only open parameter of k-NN number. Experiments on AVICAR corpus provide evidence that the graph-based pattern classification methods can outperform classical ones for lipreading.
Keywords :
discrete cosine transforms; feature extraction; graph theory; image classification; image sequences; video signal processing; AVICAR corpus; appearance-based feature analysis; linear embedding; lipreading system; locality discriminant graph; pattern classification algorithm; video sequences data; Data mining; Discrete cosine transforms; Feature extraction; Hidden Markov models; Linear discriminant analysis; Mouth; Pattern classification; Principal component analysis; Speech analysis; Video sequences; Lipreading; audio-visual speech; discrete cosine transform; discriminant analysis; graph embedding;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Image Processing, 2007. ICIP 2007. IEEE International Conference on
Conference_Location :
San Antonio, TX
ISSN :
1522-4880
Print_ISBN :
978-1-4244-1437-6
Electronic_ISBN :
1522-4880
Type :
conf
DOI :
10.1109/ICIP.2007.4379312
Filename :
4379312
Link To Document :
بازگشت