DocumentCode :
2507538
Title :
Lipreading: A Graph Embedding Approach
Author :
Zhou, Ziheng ; Zhao, Guoying ; Pietikäinen, Matti
Author_Institution :
Dept. of Electr. & Inf. Eng., Univ. of Oulu, Oulu, Finland
fYear :
2010
fDate :
23-26 Aug. 2010
Firstpage :
523
Lastpage :
526
Abstract :
In this paper, we propose a novel graph embedding method for the problem of lipreading. To characterize the temporal connections among video frames of the same utterance, a new distance metric is defined on a pair of frames and graphs are constructed to represent the video dynamics based on the distances between frames. Audio information is used to assist in calculating such distances. For each utterance, a subspace of the visual feature space is learned from a well-defined intrinsic and penalty graph within a graph-embedding framework. Video dynamics are found to be well preserved along some dimensions of the subspace. Discriminatory cues are then decoded from curves of the projected visual features to classify different utterances.
Keywords :
graph theory; image sequences; speech processing; video signal processing; distance metric; graph embedding method; lipreading problem; penalty graph; speech perception; video dynamics; video frames; video sequences; Feature extraction; Indexes; Measurement; Training; Video sequences; Visualization; graph embedding; lipreading;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Pattern Recognition (ICPR), 2010 20th International Conference on
Conference_Location :
Istanbul
ISSN :
1051-4651
Print_ISBN :
978-1-4244-7542-1
Type :
conf
DOI :
10.1109/ICPR.2010.133
Filename :
5597428
Link To Document :
بازگشت