DocumentCode
1999803
Title
Model-based versus knowledge-guided representation of non-rigid objects: a case study
Author
Kober, R. ; Schiffers, J. ; Schmidt, K.
Author_Institution
Res. Inst for Appl. Knowledge Process., Ulm, Germany
Volume
1
fYear
1994
fDate
13-16 Nov 1994
Firstpage
973
Abstract
Two different approaches to the detection and representation of the mouth in real video images of the human face are investigated. The model-based approach presented is based on a technique known as “deformable templates”, and tries to approximate the contours of the lips with a model consisting of four parabolas. An alternative to the model-based approach, referred to as the knowledge-guided approach, is proposed. The basic idea is not to try to capture all of the a priori knowledge about an object in a single global model that is adapted to the image, but rather to utilize the a priori knowledge in a step-by-step way, in order to refine rough initial hypotheses into a compact description of the object. This method may be interpreted as a gradual concentration on the relevant structures in the image. The combination of the resulting structures yields a compact description of the object. In the application, which is the basis for this investigation, the goal is to enhance speech recognition by using visual information about lip movements in addition to the acoustic signal. Only the problem of finding an accurate and robust representation of the lips in an image is addressed. Each of the methods were investigated for the same set of 15 faces. Our experiments indicate that the knowledge-guided approach performs more accurately and more robustly, than the model-based approach
Keywords
face recognition; image representation; knowledge based systems; knowledge representation; model-based reasoning; object detection; speech recognition; video signal processing; acoustic signal; deformable templates; experiments; human face; image representation; image structures; knowledge-guided representation; lip movements; lips contour approximation; model-based representation; mouth detection; mouth representation; non-rigid objects representation; parabolas; real video images; speech recognition; visual information; Computer aided software engineering; Eyes; Face detection; Humans; Lips; Mouth; Object detection; Robustness; Solid modeling; Speech recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Image Processing, 1994. Proceedings. ICIP-94., IEEE International Conference
Conference_Location
Austin, TX
Print_ISBN
0-8186-6952-7
Type
conf
DOI
10.1109/ICIP.1994.413254
Filename
413254
Link To Document