DocumentCode :
3247208
Title :
Lip reading using optical flow and support vector machines
Author :
Shaikh, Ayaz A. ; Kumar, Dinesh K. ; Yau, Wai C. ; Azemin, M. Z Che ; Gubbi, Jayavardhana
Author_Institution :
Sch. of Electr. & Comput. Eng., RMIT Univ., Melbourne, VIC, Australia
Volume :
1
fYear :
2010
fDate :
16-18 Oct. 2010
Firstpage :
327
Lastpage :
330
Abstract :
This paper presents a lip reading technique to classify the discrete utterances without evaluating the acoustic signals. The reported technique analysis the video data of lip motions by computing the optical flow (OF). The statistical properties of the vertical OF component were used to form the feature vectors for training the support vector machines (SVM) classifier. The impact of the variation in speed/velocity of speaking on the performance of the system was minimized by removing the zero energy frames and normalizing the number of frames by interpolation. The resulting system is an efficient visual viseme classifier with high accuracy (95.9%), specificity (98.1%) and sensitivity (66.4%). The results of the experiments demonstrate the developed technique is insensitive to inter speaker variations.
Keywords :
feature extraction; support vector machines; discrete utterance; feature vector; lip motion; lip reading; optical flow; support vector machines; Computer vision; Image motion analysis; Optical imaging; Speech; Speech recognition; Support vector machines; Visualization; Lipreading; optical flow; support vector machine;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Image and Signal Processing (CISP), 2010 3rd International Congress on
Conference_Location :
Yantai
Print_ISBN :
978-1-4244-6513-2
Type :
conf
DOI :
10.1109/CISP.2010.5646264
Filename :
5646264
Link To Document :
بازگشت