Title :
Smoothed language model incorporation for efficient time-synchronous beam search decoding in LVCSR
Author :
Willett, Daniel ; McDermott, Erik ; Katagiri, Shigem
Author_Institution :
NTT Commun. Sci. Labs., Kyoto, Japan
Abstract :
For performing the decoding search in large vocabulary continuous speech recognition (LVCSR) with hidden Markov models (HMM) and statistical language models, the most straightforward and popular approach is the time-synchronous beam search procedure. A drawback of this approach is that the time-asynchrony of the language model weight application during search leads to performance degradations. This is particularly so when performing the search with a tight pruning beam. This study presents a method for smoothing the language model within the recognition network. The optimization goal is the smearing of transition probabilities from HMM state to HMM state in favor of a more time-synchronous language model weight application. In addition, state-based language model look-ahead is proposed and evaluated. Both language model smoothing techniques lead to a remarkable improvement in accuracy-to-run-time ratio, while their combined application yields only limited improvements.
Keywords :
hidden Markov models; optimisation; probability; speech recognition; vocabulary; HMM; LVCSR; decoding search; hidden Markov models; language model smoothing; large vocabulary continuous speech recognition; optimization; recognition network; state-based language model look-ahead; statistical language models; time-synchronous language model weight; transition probability smearing; Acoustic beams; Context modeling; Decoding; Degradation; Hidden Markov models; Laboratories; Natural languages; Smoothing methods; Speech recognition; Viterbi algorithm;
Conference_Titel :
Automatic Speech Recognition and Understanding, 2001. ASRU '01. IEEE Workshop on
Print_ISBN :
0-7803-7343-X
DOI :
10.1109/ASRU.2001.1034616