• DocumentCode
    2176598
  • Title

    Adapting acoustic and lexical models to dysarthric speech

  • Author

    Mengistu, Kinfe Tadesse ; Rudzicz, Frank

  • Author_Institution
    Dept. of Comput. Sci., Univ. of Toronto, Toronto, ON, Canada
  • fYear
    2011
  • fDate
    22-27 May 2011
  • Firstpage
    4924
  • Lastpage
    4927
  • Abstract
    Dysarthria is a motor speech disorder resulting from neurological damage to the part of the brain that controls the physical production of speech. It is, in part, characterized by pronunciation errors that include deletions, substitutions, insertions, and distortions of phonemes. These errors follow consistent intra-speaker patterns that we exploit through acoustic and lexical model adaptation to improve automatic speech recognition (ASR) on dysarthric speech. We show that acoustic model adaptation yields an average relative word error rate (WER) reduction of 36.99% and that pronunciation lexicon adaptation (PLA) further reduces the relative WER by an average of 8.29% on a large vocabulary task of over 1500 words for six speakers with severe to moderate dysarthria. PLA also shows an average relative WER reduction of 7.11% on speaker-dependent models evaluated using 5-fold cross-validation.
  • Keywords
    speech recognition; ASR; PLA; WER reduction; adapting acoustic; automatic speech recognition; dysarthric speech; intra-speaker patterns; lexical models; motor speech disorder; word error rate reduction; Acoustics; Adaptation models; Data models; Databases; Hidden Markov models; Speech; Speech recognition; dysarthria; dysarthric speech; pronunciation lexicon adaptation; speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
  • Conference_Location
    Prague
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4577-0538-0
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2011.5947460
  • Filename
    5947460