Smoothed language model incorporation for efficient time-synchronous beam search decoding in LVCSR

Author

Willett, Daniel ; McDermott, Erik ; Katagiri, Shigem

Author_Institution

NTT Commun. Sci. Labs., Kyoto, Japan

fYear

2001

fDate

2001

Firstpage

178

Lastpage

181

Abstract

For performing the decoding search in large vocabulary continuous speech recognition (LVCSR) with hidden Markov models (HMM) and statistical language models, the most straightforward and popular approach is the time-synchronous beam search procedure. A drawback of this approach is that the time-asynchrony of the language model weight application during search leads to performance degradations. This is particularly so when performing the search with a tight pruning beam. This study presents a method for smoothing the language model within the recognition network. The optimization goal is the smearing of transition probabilities from HMM state to HMM state in favor of a more time-synchronous language model weight application. In addition, state-based language model look-ahead is proposed and evaluated. Both language model smoothing techniques lead to a remarkable improvement in accuracy-to-run-time ratio, while their combined application yields only limited improvements.

Keywords

hidden Markov models; optimisation; probability; speech recognition; vocabulary; HMM; LVCSR; decoding search; hidden Markov models; language model smoothing; large vocabulary continuous speech recognition; optimization; recognition network; state-based language model look-ahead; statistical language models; time-synchronous language model weight; transition probability smearing; Acoustic beams; Context modeling; Decoding; Degradation; Hidden Markov models; Laboratories; Natural languages; Smoothing methods; Speech recognition; Viterbi algorithm;

fLanguage

English

Publisher

ieee

Conference_Titel

Automatic Speech Recognition and Understanding, 2001. ASRU '01. IEEE Workshop on

Print_ISBN

0-7803-7343-X

Type

conf

DOI

10.1109/ASRU.2001.1034616

Filename

1034616