DocumentCode :
2215574
Title :
Robust recognition of faces and facial features with a multi-modal system
Author :
Graf, Hans Peter ; Cosatto, Eric ; Potamianos, Makis
Author_Institution :
AT&T Labs., Red Bank, NJ, USA
Volume :
3
fYear :
1997
fDate :
12-15 Oct 1997
Firstpage :
2034
Abstract :
We use a combination of shape and texture analysis, color segmentation and motion information for finding the positions of whole faces plus the precise location and shape of the mouth. Combining several modalities improves the robustness of the analysis considerably and allows handling of a wide variety of conditions. Mouth shapes can enhance the accuracy of speech recognition systems, in particular under noisy conditions. However, finding the shape of the mouth precisely under unrestricted conditions is a challenging task. To be of practical value a system must handle different complexions of people as well as variations in lighting, different head orientations, moustaches, beards, and glasses. To deal with such a diversity of conditions, our system includes several different models of the face and the mouth area. New faces are compared to these models and the most representative one is chosen for the analysis. We tested our system on samples from video sequences of 50 different speakers. When trained on a particular person, the mouth location is found correctly in more than 98% of the images. When trained on a random set of 10 people from the database, the system handles typically 87% of the other people correctly. In speaker-dependent lip reading experiments we observed 93% word accuracy on five-digit strings
Keywords :
face recognition; image colour analysis; image sequences; image texture; motion estimation; speech recognition; video signal processing; color segmentation; facial features; five-digit strings; motion information; mouth; multi-modal system; robust recognition; shape analysis; speaker-dependent lip reading; speech recognition systems; texture analysis; unrestricted conditions; video sequences; word accuracy; Face recognition; Facial features; Image color analysis; Information analysis; Motion analysis; Mouth; Noise shaping; Robustness; Shape; Speech recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Systems, Man, and Cybernetics, 1997. Computational Cybernetics and Simulation., 1997 IEEE International Conference on
Conference_Location :
Orlando, FL
ISSN :
1062-922X
Print_ISBN :
0-7803-4053-1
Type :
conf
DOI :
10.1109/ICSMC.1997.635160
Filename :
635160
Link To Document :
بازگشت