Robust recognition of faces and facial features with a multi-modal system

Author

Graf, Hans Peter ; Cosatto, Eric ; Potamianos, Makis

Author_Institution

AT&T Labs., Red Bank, NJ, USA

Volume

3

fYear

1997

fDate

12-15 Oct 1997

Firstpage

2034

Abstract

We use a combination of shape and texture analysis, color segmentation and motion information for finding the positions of whole faces plus the precise location and shape of the mouth. Combining several modalities improves the robustness of the analysis considerably and allows handling of a wide variety of conditions. Mouth shapes can enhance the accuracy of speech recognition systems, in particular under noisy conditions. However, finding the shape of the mouth precisely under unrestricted conditions is a challenging task. To be of practical value a system must handle different complexions of people as well as variations in lighting, different head orientations, moustaches, beards, and glasses. To deal with such a diversity of conditions, our system includes several different models of the face and the mouth area. New faces are compared to these models and the most representative one is chosen for the analysis. We tested our system on samples from video sequences of 50 different speakers. When trained on a particular person, the mouth location is found correctly in more than 98% of the images. When trained on a random set of 10 people from the database, the system handles typically 87% of the other people correctly. In speaker-dependent lip reading experiments we observed 93% word accuracy on five-digit strings

Keywords

face recognition; image colour analysis; image sequences; image texture; motion estimation; speech recognition; video signal processing; color segmentation; facial features; five-digit strings; motion information; mouth; multi-modal system; robust recognition; shape analysis; speaker-dependent lip reading; speech recognition systems; texture analysis; unrestricted conditions; video sequences; word accuracy; Face recognition; Facial features; Image color analysis; Information analysis; Motion analysis; Mouth; Noise shaping; Robustness; Shape; Speech recognition;

fLanguage

English

Publisher

ieee

Conference_Titel

Systems, Man, and Cybernetics, 1997. Computational Cybernetics and Simulation., 1997 IEEE International Conference on

Conference_Location

Orlando, FL

ISSN

1062-922X

Print_ISBN

0-7803-4053-1

Type

conf

DOI

10.1109/ICSMC.1997.635160

Filename

635160