DocumentCode
2215574
Title
Robust recognition of faces and facial features with a multi-modal system
Author
Graf, Hans Peter ; Cosatto, Eric ; Potamianos, Makis
Author_Institution
AT&T Labs., Red Bank, NJ, USA
Volume
3
fYear
1997
fDate
12-15 Oct 1997
Firstpage
2034
Abstract
We use a combination of shape and texture analysis, color segmentation and motion information for finding the positions of whole faces plus the precise location and shape of the mouth. Combining several modalities improves the robustness of the analysis considerably and allows handling of a wide variety of conditions. Mouth shapes can enhance the accuracy of speech recognition systems, in particular under noisy conditions. However, finding the shape of the mouth precisely under unrestricted conditions is a challenging task. To be of practical value a system must handle different complexions of people as well as variations in lighting, different head orientations, moustaches, beards, and glasses. To deal with such a diversity of conditions, our system includes several different models of the face and the mouth area. New faces are compared to these models and the most representative one is chosen for the analysis. We tested our system on samples from video sequences of 50 different speakers. When trained on a particular person, the mouth location is found correctly in more than 98% of the images. When trained on a random set of 10 people from the database, the system handles typically 87% of the other people correctly. In speaker-dependent lip reading experiments we observed 93% word accuracy on five-digit strings
Keywords
face recognition; image colour analysis; image sequences; image texture; motion estimation; speech recognition; video signal processing; color segmentation; facial features; five-digit strings; motion information; mouth; multi-modal system; robust recognition; shape analysis; speaker-dependent lip reading; speech recognition systems; texture analysis; unrestricted conditions; video sequences; word accuracy; Face recognition; Facial features; Image color analysis; Information analysis; Motion analysis; Mouth; Noise shaping; Robustness; Shape; Speech recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Systems, Man, and Cybernetics, 1997. Computational Cybernetics and Simulation., 1997 IEEE International Conference on
Conference_Location
Orlando, FL
ISSN
1062-922X
Print_ISBN
0-7803-4053-1
Type
conf
DOI
10.1109/ICSMC.1997.635160
Filename
635160
Link To Document