DocumentCode
2222987
Title
Visual speech recognition: a solution from feature extraction to words classification
Author
Da Silveira, Luciana Gonçalves ; Facon, Jacques ; Borges, Díbio Leandro
Author_Institution
Faculdade Cambury, Cambury Coll., Goiania, Brazil
fYear
2003
fDate
12-15 Oct. 2003
Firstpage
399
Lastpage
405
Abstract
Audio-visual speech recognition has been an active area of research lately. A bit, and yet unsolved part of this problem is the visual only recognition, or lip reading. Considering an image sequence of a person pronouncing a word, a full image analysis solution would have to segment the mouth area, extract relevant features, and use them to be able to classify the word from those visual features. We approach this problem by proposing a segmentation technique for the lips contours together with a set of features based on the extracted contours which is able to perform lip reading with promising results. We have collected visual speech sequences in our lab and show the results for a set of ten words in Brazilian Portuguese, spoken by different speakers in more than 150 samples. The approach can be extended and applied to other spoken languages as well.
Keywords
edge detection; feature extraction; image segmentation; image sequences; natural languages; speech recognition; Brazilian Portuguese; extracted contours; feature extraction; image analysis solution; image segmentation technique; image sequence; lip reading; lips contours; visual speech recognition; visual speech sequences; words classification; Active shape model; Automatic speech recognition; Feature extraction; Image recognition; Image segmentation; Image sequences; Lips; Mouth; Speech recognition; Vocabulary;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer Graphics and Image Processing, 2003. SIBGRAPI 2003. XVI Brazilian Symposium on
ISSN
1530-1834
Print_ISBN
0-7695-2032-4
Type
conf
DOI
10.1109/SIBGRA.2003.1241036
Filename
1241036
Link To Document