• DocumentCode
    86288
  • Title

    Tackling Speaking Mode Varieties in EMG-Based Speech Recognition

  • Author

    Wand, Michael ; Janke, Matthias ; Schultz, Tanja

  • Author_Institution
    Cognitive Syst. Lab., Karlsruhe Inst. of Technol., Karlsruhe, Germany
  • Volume
    61
  • Issue
    10
  • fYear
    2014
  • fDate
    Oct. 2014
  • Firstpage
    2515
  • Lastpage
    2526
  • Abstract
    An electromyographic (EMG) silent speech recognizer is a system that recognizes speech by capturing the electric potentials of the human articulatory muscles, thus enabling the user to communicate silently. After having established a baseline EMG-based continuous speech recognizer, in this paper, we investigate speaking mode variations, i.e., discrepancies between audible and silent speech that deteriorate recognition accuracy. We introduce multimode systems that allow seamless switching between audible and silent speech, investigate different measures which quantify speaking mode differences, and present the spectral mapping algorithm, which improves the word error rate (WER) on silent speech by up to 14.3% relative. Our best average silent speech WER is 34.7%, and our best WER on audibly spoken speech is 16.8%.
  • Keywords
    electromyography; medical signal detection; medical signal processing; speech; speech recognition; EMG-Based speech recognition; EMG-based continuous speech recognizer; audible speech; electric potential capturing; electromyography; human articulatory muscles; multimode systems; silent speech; speaking mode variations; spectral mapping algorithm; word error rate; Electrodes; Electromyography; Hidden Markov models; Muscles; Speech; Speech processing; Speech recognition; EMG-based speech recognition; Electromyography (EMG); silent speech interfaces (SSI);
  • fLanguage
    English
  • Journal_Title
    Biomedical Engineering, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0018-9294
  • Type

    jour

  • DOI
    10.1109/TBME.2014.2319000
  • Filename
    6802380