DocumentCode
2176598
Title
Adapting acoustic and lexical models to dysarthric speech
Author
Mengistu, Kinfe Tadesse ; Rudzicz, Frank
Author_Institution
Dept. of Comput. Sci., Univ. of Toronto, Toronto, ON, Canada
fYear
2011
fDate
22-27 May 2011
Firstpage
4924
Lastpage
4927
Abstract
Dysarthria is a motor speech disorder resulting from neurological damage to the part of the brain that controls the physical production of speech. It is, in part, characterized by pronunciation errors that include deletions, substitutions, insertions, and distortions of phonemes. These errors follow consistent intra-speaker patterns that we exploit through acoustic and lexical model adaptation to improve automatic speech recognition (ASR) on dysarthric speech. We show that acoustic model adaptation yields an average relative word error rate (WER) reduction of 36.99% and that pronunciation lexicon adaptation (PLA) further reduces the relative WER by an average of 8.29% on a large vocabulary task of over 1500 words for six speakers with severe to moderate dysarthria. PLA also shows an average relative WER reduction of 7.11% on speaker-dependent models evaluated using 5-fold cross-validation.
Keywords
speech recognition; ASR; PLA; WER reduction; adapting acoustic; automatic speech recognition; dysarthric speech; intra-speaker patterns; lexical models; motor speech disorder; word error rate reduction; Acoustics; Adaptation models; Data models; Databases; Hidden Markov models; Speech; Speech recognition; dysarthria; dysarthric speech; pronunciation lexicon adaptation; speech recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
Conference_Location
Prague
ISSN
1520-6149
Print_ISBN
978-1-4577-0538-0
Electronic_ISBN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2011.5947460
Filename
5947460
Link To Document