Turbo Processing for Speech Recognition

Author

Moon, Todd K. ; Gunther, Jacob H. ; Broadus, Cortnie ; Hou, Wenhao ; Nelson, N.

Author_Institution

Inf. Dynamics Lab. & the Electr. & Comput. Eng. Dept., Utah State Univ., Logan, UT, USA

Volume

44

Issue

1

fYear

2014

fDate

Jan. 2014

Firstpage

83

Lastpage

91

Abstract

Speech recognition is a classic example of a human/machine interface, typifying many of the difficulties and opportunities of human/machine interaction. In this paper, speech recognition is used as an example of applying turbo processing principles to the general problem of human/machine interface. Speech recognizers frequently involve a model representing phonemic information at a local level, followed by a language model representing information at a nonlocal level. This structure is analogous to the local (e.g., equalizer) and nonlocal (e.g., error correction decoding) elements common in digital communications. Drawing from the analogy of turbo processing for digital communications, turbo speech processing iteratively feeds back the output of the language model to be used as prior probabilities for the phonemic model. This analogy is developed here, and the performance of this turbo model is characterized by using an artificial language model. Using turbo processing, the relative error rate improves significantly, especially in high-noise settings.

Keywords

human computer interaction; speech recognition; artificial language model; digital communications; human-machine interaction; human-machine interface; language model; phonemic model; speech recognition; turbo speech processing; Human–machine interface; speech processing; turbo processing;

fLanguage

English

Journal_Title

Cybernetics, IEEE Transactions on

Publisher

ieee

ISSN

2168-2267

Type

jour

DOI

10.1109/TCYB.2013.2247593

Filename

6495711