DocumentCode :
534267
Title :
Research of Visual Features Detection and Tracking Methods about Audio-Visual Bimodal Speech Recognition
Author :
Lirong, Wang ; Jing, Xu ; Yanyan, Zhao
Author_Institution :
Dept. of Electron. & Inf., Univ. of ChangChun, ChangChun, China
Volume :
1
fYear :
2010
fDate :
16-18 July 2010
Firstpage :
204
Lastpage :
207
Abstract :
Audio-visual bimodal speech recognition can improve speech recognition rate, the lip detection, location and tracking is the key of bimodal speech recognition system. This article discusses the lip detection, location and tracking algorithms of bimodal speech recognition. Locate lips precisely by use geometric structure of face, relative position of lips and separable color information of color space. Using adaptive color filter to segment the lip contour effectively, and use PMM algorithm to locate and track lip precisely. Experimental results shown that the algorithms studied in this paper can detect, locate and track lips precisely, robustly and quickly.
Keywords :
Markov processes; adaptive filters; audio-visual systems; face recognition; image colour analysis; image segmentation; optical filters; speech recognition; PMM algorithm; adaptive color filter; audio visual bimodal speech recognition; color space information; feature tracking method; geometric face structure; lip contour segmentation; lip detection; visual feature detection; Image color analysis; Image segmentation; Information filters; Lips; Mouth; Skin; Speech recognition; audio-visual bimodal; lip detection; recognition; tracking;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information Technology and Applications (IFITA), 2010 International Forum on
Conference_Location :
Kunming
Print_ISBN :
978-1-4244-7621-3
Electronic_ISBN :
978-1-4244-7622-0
Type :
conf
DOI :
10.1109/IFITA.2010.283
Filename :
5635120
Link To Document :
بازگشت