DocumentCode :
2176598
Title :
Adapting acoustic and lexical models to dysarthric speech
Author :
Mengistu, Kinfe Tadesse ; Rudzicz, Frank
Author_Institution :
Dept. of Comput. Sci., Univ. of Toronto, Toronto, ON, Canada
fYear :
2011
fDate :
22-27 May 2011
Firstpage :
4924
Lastpage :
4927
Abstract :
Dysarthria is a motor speech disorder resulting from neurological damage to the part of the brain that controls the physical production of speech. It is, in part, characterized by pronunciation errors that include deletions, substitutions, insertions, and distortions of phonemes. These errors follow consistent intra-speaker patterns that we exploit through acoustic and lexical model adaptation to improve automatic speech recognition (ASR) on dysarthric speech. We show that acoustic model adaptation yields an average relative word error rate (WER) reduction of 36.99% and that pronunciation lexicon adaptation (PLA) further reduces the relative WER by an average of 8.29% on a large vocabulary task of over 1500 words for six speakers with severe to moderate dysarthria. PLA also shows an average relative WER reduction of 7.11% on speaker-dependent models evaluated using 5-fold cross-validation.
Keywords :
speech recognition; ASR; PLA; WER reduction; adapting acoustic; automatic speech recognition; dysarthric speech; intra-speaker patterns; lexical models; motor speech disorder; word error rate reduction; Acoustics; Adaptation models; Data models; Databases; Hidden Markov models; Speech; Speech recognition; dysarthria; dysarthric speech; pronunciation lexicon adaptation; speech recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
Conference_Location :
Prague
ISSN :
1520-6149
Print_ISBN :
978-1-4577-0538-0
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2011.5947460
Filename :
5947460
Link To Document :
بازگشت