DocumentCode
13395
Title
Turbo Processing for Speech Recognition
Author
Moon, Todd K. ; Gunther, Jacob H. ; Broadus, Cortnie ; Hou, Wenhao ; Nelson, N.
Author_Institution
Inf. Dynamics Lab. & the Electr. & Comput. Eng. Dept., Utah State Univ., Logan, UT, USA
Volume
44
Issue
1
fYear
2014
fDate
Jan. 2014
Firstpage
83
Lastpage
91
Abstract
Speech recognition is a classic example of a human/machine interface, typifying many of the difficulties and opportunities of human/machine interaction. In this paper, speech recognition is used as an example of applying turbo processing principles to the general problem of human/machine interface. Speech recognizers frequently involve a model representing phonemic information at a local level, followed by a language model representing information at a nonlocal level. This structure is analogous to the local (e.g., equalizer) and nonlocal (e.g., error correction decoding) elements common in digital communications. Drawing from the analogy of turbo processing for digital communications, turbo speech processing iteratively feeds back the output of the language model to be used as prior probabilities for the phonemic model. This analogy is developed here, and the performance of this turbo model is characterized by using an artificial language model. Using turbo processing, the relative error rate improves significantly, especially in high-noise settings.
Keywords
human computer interaction; speech recognition; artificial language model; digital communications; human-machine interaction; human-machine interface; language model; phonemic model; speech recognition; turbo speech processing; Human–machine interface; speech processing; turbo processing;
fLanguage
English
Journal_Title
Cybernetics, IEEE Transactions on
Publisher
ieee
ISSN
2168-2267
Type
jour
DOI
10.1109/TCYB.2013.2247593
Filename
6495711
Link To Document