DocumentCode :
1370757
Title :
Computer lipreading for improved accuracy in automatic speech recognition
Author :
Silsbee, Peter L. ; Bovik, Alan C.
Author_Institution :
Dept. of Electr. & Comput. Eng., Texas Univ., Austin, TX, USA
Volume :
4
Issue :
5
fYear :
1996
fDate :
9/1/1996 12:00:00 AM
Firstpage :
337
Lastpage :
351
Abstract :
Among the various methods that have been proposed to improve the robustness and accuracy of automatic speech recognition (ASR) systems, lipreading has received little attention until very recently. However, results from the psychological literature indicate that lipreading, in conjunction with auditory perception, can provide a strong improvement in speech recognition and understanding in humans. We have developed a novel speaker-dependent lipreading system that uses hidden Markov models. An audiovisual system known as Lipreading to Enhance Automatic Perception of Speech (LEAPS) is described, in which the lipreading system is used in conjunction with an audio ASR system in order to improve the accuracy of the latter, especially under degraded acoustical conditions. Experimental results are presented for two small phoneme discrimination tasks, as well as a medium vocabulary isolated word recognition task. In all cases, performance of the combined system is superior to that of the audio system, with a reduction in errors ranging from 20 to 65%
Keywords :
feature extraction; hidden Markov models; image classification; speech recognition; LEAPS; Lipreading to Enhance Automatic Perception of Speech; accuracy; audiovisual system; automatic speech recognition; computer lipreading; degraded acoustical conditions; errors; hidden Markov models; medium vocabulary isolated word recognition task; performance; small phoneme discrimination tasks; speaker-dependent lipreading system; Audio-visual systems; Automatic speech recognition; Degradation; Hidden Markov models; Humans; Psychology; Robustness; Speech enhancement; Speech recognition; Vocabulary;
fLanguage :
English
Journal_Title :
Speech and Audio Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1063-6676
Type :
jour
DOI :
10.1109/89.536928
Filename :
536928
Link To Document :
بازگشت