DocumentCode
3429977
Title
Speech Activity Detection with Lip Movement Image Signals
Author
Lee, Soo-jong ; Park, Jun ; Kim, Eung-Kyeu
Author_Institution
ETRI, Daejeon
fYear
2007
fDate
22-24 Aug. 2007
Firstpage
403
Lastpage
406
Abstract
This paper describes an attempt to correlate lip movement visual information acquired via a camera with speech audio information acquired via a microphone from a human speaker in order to prevent audio created by external noise from being misrecognized as speech emitted by said speaker. Images of the face of a human speaker are acquired via a PC camera and are then separated into images that indicate lip movement and images that do not indicate lip movement. The data of lip movement image signals is saved in shared memory and shared with the speech recognition process. This data is analyzed by the speech activity detection process, which is a pre-processing step of sound recognition. We combined a speech recognition processor and an image recognizer, and the interworking function successfully operated at the rate of 99.3%.
Keywords
computer vision; image motion analysis; image recognition; object recognition; speech recognition; PC camera; face images; human speaker; image recognition; lip movement image signals; lip movement visual information; microphone; sound recognition; speech activity detection; speech audio information; speech recognition; Acoustic noise; Cameras; Data analysis; Face; Humans; Microphones; Signal processing; Speech analysis; Speech enhancement; Speech recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Communications, Computers and Signal Processing, 2007. PacRim 2007. IEEE Pacific Rim Conference on
Conference_Location
Victoria, BC
Print_ISBN
978-1-4244-1189-4
Electronic_ISBN
1-4244-1190-4
Type
conf
DOI
10.1109/PACRIM.2007.4313259
Filename
4313259
Link To Document