• DocumentCode
    2215574
  • Title

    Robust recognition of faces and facial features with a multi-modal system

  • Author

    Graf, Hans Peter ; Cosatto, Eric ; Potamianos, Makis

  • Author_Institution
    AT&T Labs., Red Bank, NJ, USA
  • Volume
    3
  • fYear
    1997
  • fDate
    12-15 Oct 1997
  • Firstpage
    2034
  • Abstract
    We use a combination of shape and texture analysis, color segmentation and motion information for finding the positions of whole faces plus the precise location and shape of the mouth. Combining several modalities improves the robustness of the analysis considerably and allows handling of a wide variety of conditions. Mouth shapes can enhance the accuracy of speech recognition systems, in particular under noisy conditions. However, finding the shape of the mouth precisely under unrestricted conditions is a challenging task. To be of practical value a system must handle different complexions of people as well as variations in lighting, different head orientations, moustaches, beards, and glasses. To deal with such a diversity of conditions, our system includes several different models of the face and the mouth area. New faces are compared to these models and the most representative one is chosen for the analysis. We tested our system on samples from video sequences of 50 different speakers. When trained on a particular person, the mouth location is found correctly in more than 98% of the images. When trained on a random set of 10 people from the database, the system handles typically 87% of the other people correctly. In speaker-dependent lip reading experiments we observed 93% word accuracy on five-digit strings
  • Keywords
    face recognition; image colour analysis; image sequences; image texture; motion estimation; speech recognition; video signal processing; color segmentation; facial features; five-digit strings; motion information; mouth; multi-modal system; robust recognition; shape analysis; speaker-dependent lip reading; speech recognition systems; texture analysis; unrestricted conditions; video sequences; word accuracy; Face recognition; Facial features; Image color analysis; Information analysis; Motion analysis; Mouth; Noise shaping; Robustness; Shape; Speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Systems, Man, and Cybernetics, 1997. Computational Cybernetics and Simulation., 1997 IEEE International Conference on
  • Conference_Location
    Orlando, FL
  • ISSN
    1062-922X
  • Print_ISBN
    0-7803-4053-1
  • Type

    conf

  • DOI
    10.1109/ICSMC.1997.635160
  • Filename
    635160