Title :
Exploiting both local and global constraints for multi-span statistical language modeling
Author :
Bellegarda, Jerome R.
Author_Institution :
Spoken Language Group, Apple Comput. Inc., Cupertino, CA, USA
Abstract :
A new framework is proposed to integrate the various constraints, both local and global, that are present in the language. Local constraints are captured via n-gram language modeling, while global constraints are taken into account through the use of latent semantic analysis. An integrative formulation is derived for the combination of these two paradigms, resulting in several families of multi-span language models for large vocabulary speech recognition. Because of the inherent complementarity in the two types of constraints, the performance of the integrated language models, as measured by the perplexity, compares favorably with the corresponding n-gram performance
Keywords :
grammars; natural languages; smoothing methods; speech processing; speech recognition; statistical analysis; global constraints; integrated language models; large vocabulary speech recognition; latent semantic analysis; local constraints; multi-span statistical language modeling; n-gram language modeling; n-gram performance; perplexity; smoothing; Displays; History; Natural languages; Power system modeling; Predictive models; Robustness; Smoothing methods; Speech recognition; Training data; Vocabulary;
Conference_Titel :
Acoustics, Speech and Signal Processing, 1998. Proceedings of the 1998 IEEE International Conference on
Conference_Location :
Seattle, WA
Print_ISBN :
0-7803-4428-6
DOI :
10.1109/ICASSP.1998.675355