Title :
Lip detection for audio-visual speech recognition in-car environment
Author :
Navarathna, Rajitha ; Lucey, Patrick ; Dean, David ; Fookes, Clinton ; Sridharan, Sridha
Author_Institution :
Speech, Audio, Image & Video Technol., Queensland Univ. of Technol., Brisbane, QLD, Australia
Abstract :
Acoustically, car cabins are extremely noisy and as a consequence audio-only, in-car voice recognition systems perform poorly. As the visual modality is immune to acoustic noise, using the visual lip information from the driver is seen as a viable strategy in circumventing this problem by using audio visual automatic speech recognition (AVASR). However, implementing AVASR requires a system being able to accurately locate and track the drivers face and lip area in real-time. In this paper we present such an approach using the Viola-Jones algorithm. Using the AVICAR [1] in-car database, we show that the Viola- Jones approach is a suitable method of locating and tracking the driver´s lips despite the visual variability of illumination and head pose for audio-visual speech recognition system.
Keywords :
speech recognition; traffic engineering computing; AVASR; acoustic noise; audio visual automatic speech recognition; car cabins; car environment; lip detection; voice recognition systems; Face; Facial features; Smoothing methods; AVASR; AVICAR database; Viola-Jones algorithm;
Conference_Titel :
Information Sciences Signal Processing and their Applications (ISSPA), 2010 10th International Conference on
Conference_Location :
Kuala Lumpur
Print_ISBN :
978-1-4244-7165-2
DOI :
10.1109/ISSPA.2010.5605429