DocumentCode :
1884232
Title :
Continuous optical automatic speech recognition by lipreading
Author :
Goldschen, Alan J. ; Garcia, Oscar N. ; Petajan, Eric
Author_Institution :
Dept. of Electr. Eng. & Comput. Sci., George Washington Univ., Washington, DC, USA
Volume :
1
fYear :
1994
fDate :
31 Oct-2 Nov 1994
Firstpage :
572
Abstract :
We describe a continuous optical automatic speech recognizer (OASR) that uses optical information from the oral-cavity shadow of a speaker. The system achieves a 25.3 percent recognition on sentences having a perplexity of 150 without using any syntactic, semantic, acoustic, or contextual guides. We introduce 13, mostly dynamic, oral-cavity features used for optical recognition, present phones that appear optically similar (visemes) for our speaker, and present the recognition results for our hidden Markov models (HMMs) using visemes, trisemes, and generalized trisemes. We conclude that future research is warranted for optical recognition, especially when combined with other input modalities
Keywords :
hidden Markov models; optical information processing; speech coding; speech recognition; HMM; automatic speech recognition; codebook; continuous optical automatic speech recognizer; dynamic oral-cavity features; generalized trisemes; hidden Markov models; input modalities; lipreading; optical information; oral-cavity shadow; perplexity; phones; recognition results; research; sentences; trisemes; visemes; Automatic speech recognition; Cameras; Hidden Markov models; Image databases; Loudspeakers; Optical devices; Optical filters; Optical noise; Optical recording; Spatial databases;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Signals, Systems and Computers, 1994. 1994 Conference Record of the Twenty-Eighth Asilomar Conference on
Conference_Location :
Pacific Grove, CA
ISSN :
1058-6393
Print_ISBN :
0-8186-6405-3
Type :
conf
DOI :
10.1109/ACSSC.1994.471517
Filename :
471517
Link To Document :
بازگشت