• DocumentCode
    875501
  • Title

    Multiexpert automatic speech recognition using acoustic and myoelectric signals

  • Author

    Chan, Adrian D C ; Englehart, Kevin B. ; Hudgins, Bernard ; Lovely, Dennis F.

  • Author_Institution
    Dept. of Syst. & Comput. Eng., Carleton Univ., Ottawa, Ont., Canada
  • Volume
    53
  • Issue
    4
  • fYear
    2006
  • fDate
    4/1/2006 12:00:00 AM
  • Firstpage
    676
  • Lastpage
    685
  • Abstract
    Classification accuracy of conventional automatic speech recognition (ASR) systems can decrease dramatically under acoustically noisy conditions. To improve classification accuracy and increase system robustness a multiexpert ASR system is implemented. In this system, acoustic speech information is supplemented with information from facial myoelectric signals (MES). A new method of combining experts, known as the plausibility method, is employed to combine an acoustic ASR expert and a MES ASR expert. The plausibility method of combining multiple experts, which is based on the mathematical framework of evidence theory, is compared to the Borda count and score-based methods of combination. Acoustic and facial MES data were collected from 5 subjects, using a 10-word vocabulary across an 18-dB range of acoustic noise. As expected the performance of an acoustic expert decreases with increasing acoustic noise; classification accuracies of the acoustic ASR expert are as low as 11.5%. The effect of noise is significantly reduced with the addition of the MES ASR expert. Classification accuracies remain above 78.8% across the 18-dB range of acoustic noise, when the plausibility method is used to combine the opinions of multiple experts. In addition, the plausibility method produced classification accuracies higher than any individual expert at all noise levels, as well as the highest classification accuracies, except at the 9-dB noise level. Using the Borda count and score-based multiexpert systems, classification accuracies are improved relative to the acoustic ASR expert but are as low as 51.5% and 59.5%, respectively.
  • Keywords
    acoustic noise; bioacoustics; electromyography; medical expert systems; medical signal processing; signal classification; speech recognition; stability; Borda count; acoustic noise; acoustic signals; classification accuracy; facial myoelectric signals; multiexpert automatic speech recognition; plausibility method; system robustness; Acoustic noise; Automatic speech recognition; Biomedical engineering; Hidden Markov models; Linear discriminant analysis; Niobium; Noise level; Robustness; Speech recognition; Vocabulary; Classification; electromyography; multiexpert system; myoelectric signals; speech recognition; Algorithms; Electromyography; Expert Systems; Facial Muscles; Feasibility Studies; Humans; Male; Muscle Contraction; Pattern Recognition, Automated; Sound Spectrography; Speech Intelligibility; Speech Perception; Speech Production Measurement; Speech Recognition Software;
  • fLanguage
    English
  • Journal_Title
    Biomedical Engineering, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0018-9294
  • Type

    jour

  • DOI
    10.1109/TBME.2006.870224
  • Filename
    1608517