• DocumentCode
    2699630
  • Title

    Optimization of dysarthric speech recognition

  • Author

    Chen, Fangxin ; Kostov, Aleksandar

  • Author_Institution
    Dept. of Linguistics, Alberta Univ., Edmonton, Alta., Canada
  • Volume
    4
  • fYear
    1997
  • fDate
    30 Oct-2 Nov 1997
  • Firstpage
    1436
  • Abstract
    Explores the residual vocal ability of people who have severe motor impairments accompanied with severe dysarthria, and develops methods for improving the performance of automatic speech recognition (ASR) of dysarthric speech. The target applications for this technology are in the development of communication and control devices for these people. In our speech recognition system, we developed an adaptive word detection algorithm to detect words in highly irregular dysarthric speech. We also implemented perceptually-based mel frequency cepstrum coefficients (MFCC) for the parametric representation of the speech signal, and we adopted the left-to-right discrete hidden Markov model (DHMM) for speech pattern recognition. The system was tested with one person who has cerebral palsy and dysarthria, reducing the intelligibility of her speech to less than 15%. Our initial results on a word set consisting of ten digits demonstrated that recognition rates above 90% can be achieved if more than ten repetitions are used for training
  • Keywords
    adaptive systems; cepstral analysis; handicapped aids; hidden Markov models; optimisation; speech intelligibility; speech recognition; 10-digit word set; adaptive word detection algorithm; automatic speech recognition performance optimization; cerebral palsy; communication devices; communication disorders; control devices; dysarthric speech recognition; irregular dysarthric speech; left-to-right discrete hidden Markov model; motor impairments; perceptually-based mel frequency cepstrum coefficients; recognition rates; repetitions; residual vocal ability; speech intelligibility; speech signal parametric representation; training; Automatic control; Automatic speech recognition; Cepstrum; Communication system control; Detection algorithms; Hidden Markov models; Mel frequency cepstral coefficient; Pattern recognition; Speech recognition; System testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Engineering in Medicine and Biology Society, 1997. Proceedings of the 19th Annual International Conference of the IEEE
  • Conference_Location
    Chicago, IL
  • ISSN
    1094-687X
  • Print_ISBN
    0-7803-4262-3
  • Type

    conf

  • DOI
    10.1109/IEMBS.1997.756975
  • Filename
    756975