DocumentCode
2217814
Title
Audiovisual speech enhancement experiments for mouth segmentation evaluation
Author
Gacon, Pierre ; Coulon, Pierre-Yves ; Bailly, Gerard
Author_Institution
LIS, INPG, Grenoble, France
fYear
2006
fDate
4-8 Sept. 2006
Firstpage
1
Lastpage
5
Abstract
Mouth segmentation is an important issue which applies in many multimedia applications as speech reading, face synthesis, recognition or audiovisual communication. Our goal is to have a robust and efficient detection of lips contour in order to restore as faithfully as possible the speech movement. We present a methodology which focused on the detection of the inner and outer mouth contours which is a difficult task due to the non-linear appearance variations. Our method is based on a statistical model of shape with local appearance gaussian descriptors whose theoretical responses were predicted by a non-linear neural network. From our automatic segmentation of the mouth, we can generate a clone of a speaker mouth whose lips movements will be as close as possible of the original ones. In this paper, results obtained by this methodology are evaluated qualitatively by testing the relevance of this clone. We carried out an experience which quantified the effective enhancement in comprehension brought by our analysis-resynthesis scheme in a telephone enquiry task.
Keywords
Gaussian processes; face recognition; neural nets; speech enhancement; statistical analysis; analysis-resynthesis scheme; audiovisual communication; audiovisual speech enhancement; face recognition; face synthesis; gaussian descriptors; mouth segmentation evaluation; multimedia applications; nonlinear neural network; speech movement; speech reading; statistical model; telephone enquiry task; theoretical responses; Image segmentation; Lips; Mathematical model; Mouth; Noise; Shape; Speech;
fLanguage
English
Publisher
ieee
Conference_Titel
Signal Processing Conference, 2006 14th European
Conference_Location
Florence
ISSN
2219-5491
Type
conf
Filename
7071307
Link To Document