DocumentCode :
2246254
Title :
The effect of visual information on word initial consonant perception of dysarthric speech
Author :
Schumeyer, Richard P. ; Barner, Kenneth E.
Author_Institution :
Appl. Sci. & Eng. Labs., Delaware Univ./A.I. duPont Inst., Wilmington, DE, USA
Volume :
1
fYear :
1996
fDate :
3-6 Oct 1996
Firstpage :
46
Abstract :
Disabled individuals can realize many benefits from automatic speech recognition (ASR). To date, most ASR research has focused on normal speech. However, many individuals with physical disabilities also exhibit speech disorders. While limited research has been conducted focusing on dysarthric speech recognition, the preliminary results indicate that additional study is necessary. Recently, increasing attention has been given to multimodal speech recognition schemes that utilize multiple input sources-most commonly audio and video. This multimodal approach has been applied to normal speech with demonstrated effectiveness. Through studying the effect of audio and visual information in a human perception experiment, this study attempts to discover whether such an approach would be useful for dysarthric speech recognition. Results of a closed-vocabulary perception test are presented. In this test, 15 normal-hearing viewers were presented with video-tapes of three dysarthric speakers (with cerebral palsy) speaking a series of one-syllable nonsense words. These words differed only in the initial consonant. The words were presented in both audio-only and audio-visual modes. Perception rates in both modes were measured. The results are analyzed and compared to other studies of visual speech perception and dysarthric speech articulation
Keywords :
audio-visual systems; handicapped aids; hearing; human factors; speech intelligibility; speech recognition; visual perception; vocabulary; audio information; cerebral palsy; closed-vocabulary perception test; disabled individuals; dysarthric speech articulation; dysarthric speech recognition; multimodal speech recognition schemes; multiple input sources; one-syllable nonsense words; perception rates; speech disorders; video-tapes; visual information; visual speech perception; word initial consonant perception; Auditory system; Automatic speech recognition; Humans; Laboratories; Lamps; Speech analysis; Speech recognition; Telephony; Testing; Vocabulary;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Spoken Language, 1996. ICSLP 96. Proceedings., Fourth International Conference on
Conference_Location :
Philadelphia, PA
Print_ISBN :
0-7803-3555-4
Type :
conf
DOI :
10.1109/ICSLP.1996.607021
Filename :
607021
Link To Document :
بازگشت