• DocumentCode
    13395
  • Title

    Turbo Processing for Speech Recognition

  • Author

    Moon, Todd K. ; Gunther, Jacob H. ; Broadus, Cortnie ; Hou, Wenhao ; Nelson, N.

  • Author_Institution
    Inf. Dynamics Lab. & the Electr. & Comput. Eng. Dept., Utah State Univ., Logan, UT, USA
  • Volume
    44
  • Issue
    1
  • fYear
    2014
  • fDate
    Jan. 2014
  • Firstpage
    83
  • Lastpage
    91
  • Abstract
    Speech recognition is a classic example of a human/machine interface, typifying many of the difficulties and opportunities of human/machine interaction. In this paper, speech recognition is used as an example of applying turbo processing principles to the general problem of human/machine interface. Speech recognizers frequently involve a model representing phonemic information at a local level, followed by a language model representing information at a nonlocal level. This structure is analogous to the local (e.g., equalizer) and nonlocal (e.g., error correction decoding) elements common in digital communications. Drawing from the analogy of turbo processing for digital communications, turbo speech processing iteratively feeds back the output of the language model to be used as prior probabilities for the phonemic model. This analogy is developed here, and the performance of this turbo model is characterized by using an artificial language model. Using turbo processing, the relative error rate improves significantly, especially in high-noise settings.
  • Keywords
    human computer interaction; speech recognition; artificial language model; digital communications; human-machine interaction; human-machine interface; language model; phonemic model; speech recognition; turbo speech processing; Human–machine interface; speech processing; turbo processing;
  • fLanguage
    English
  • Journal_Title
    Cybernetics, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    2168-2267
  • Type

    jour

  • DOI
    10.1109/TCYB.2013.2247593
  • Filename
    6495711