DocumentCode
314523
Title
Visual speech recognition by recurrent neural networks
Author
Rabi, Gihad ; Lu, Siwei
Author_Institution
Dept. of Comput. Sci., Memorial Univ. of Newfoundland, St. John´´s, Nfld., Canada
Volume
1
fYear
1997
fDate
25-28 May 1997
Firstpage
55
Abstract
A system for visual speech recognition is described in this paper. In the first phase of the system´s operation, time-varying visual speech patterns are obtained from a sequence of images. In the second phase, the system uses recurrent neural networks to classify the spatio-temporal pattern as one of the previously-trained words. By specifying a certain behavior when a recurrent network is presented with exemplar sequences, the network is trained with no more than feed-forward complexity. The network´s desired behavior is based on characterizing a given word by well-defined segments. Adaptive segmentation is employed to segment the training sequences of a given word
Keywords
feature extraction; image recognition; image segmentation; image sequences; recurrent neural nets; speech recognition; adaptive segmentation; feed-forward complexity; image sequence; recurrent neural networks; spatio-temporal pattern; time-varying visual speech patterns; training sequences; visual speech recognition; Automatic speech recognition; Computer science; Crosstalk; Feedforward systems; Image segmentation; Mouth; Recurrent neural networks; Shape; Speech analysis; Speech recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Electrical and Computer Engineering, 1997. Engineering Innovation: Voyage of Discovery. IEEE 1997 Canadian Conference on
Conference_Location
St. Johns, Nfld.
ISSN
0840-7789
Print_ISBN
0-7803-3716-6
Type
conf
DOI
10.1109/CCECE.1997.614788
Filename
614788
Link To Document