DocumentCode :
2401807
Title :
Macro-cuboïd based probabilistic matching for lip-reading digits
Author :
Pachoud, Samuel ; Gong, Shaogang ; Cavallaro, Andrea
Author_Institution :
London Univ., London
fYear :
2008
fDate :
23-28 June 2008
Firstpage :
1
Lastpage :
8
Abstract :
In this paper, we present a spatio-temporal feature representation and a probabilistic matching function to recognise lip movements from pronounced digits. Our model (1) automatically selects spatio-temporal features extracted from 10 digit model templates and (2) matches them with probe video sequences. Spatio-temporal features embed lip movements from pronouncing digits and contain more discriminative information than spatial features alone. A model template for each digit is represented by a set of spatio-temporal features at multiple scales. A probabilistic sequence matching function automatically segments a probe video sequence and matches the most likely sequence of digits recognised in the probe sequence. We demonstrate the proposed approach using the CUAVE database and compare our representational scheme with three alternative methods, based on optical flow, intensity gradient and block matching, respectively. The evaluation shows that the proposed approach outperforms the others in recognition accuracy and is robust in coping with variations in probe sequences.
Keywords :
feature extraction; image matching; image motion analysis; image representation; image sequences; probability; video signal processing; CUAVE database; lip movements; lip-reading digits; macro-cuboid based probabilistic matching; optical flow; probabilistic sequence matching function; spatio-temporal feature representation; spatio-temporal features extraction; Data mining; Feature extraction; Image motion analysis; Mouth; Optical sensors; Probes; Robustness; Shape; Speech; Video sequences;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Vision and Pattern Recognition, 2008. CVPR 2008. IEEE Conference on
Conference_Location :
Anchorage, AK
ISSN :
1063-6919
Print_ISBN :
978-1-4244-2242-5
Electronic_ISBN :
1063-6919
Type :
conf
DOI :
10.1109/CVPR.2008.4587734
Filename :
4587734
Link To Document :
بازگشت