DocumentCode :
2979436
Title :
A statistical language modeling approach integrating local and global constraints
Author :
Bellegarda, Jerome R.
Author_Institution :
Spoken Language Group, Apple Comput. Inc., Cupertino, CA, USA
fYear :
1997
fDate :
14-17 Dec 1997
Firstpage :
262
Lastpage :
269
Abstract :
A new framework is proposed to integrate the various constraints, both local and global, that are present in language. Local constraints are captured via n-gram language modeling, while global constraints are taken into account through the use of latent semantic analysis. An integrative formulation is derived for the combination of these two paradigms, resulting in several families of multi-span language models for large-vocabulary speech recognition. Because of the inherent complementarity in the two types of constraints, the performance of the integrated language models, as measured by perplexity, compares favorably with the corresponding n-gram performance
Keywords :
constraint theory; modelling; natural languages; nomograms; performance index; speech recognition; statistics; vocabulary; complementarity; global constraints; integrated language models; large-vocabulary speech recognition; latent semantic analysis; local constraints; multi-span language models; n-gram language modeling; performance; perplexity; statistical language modeling; Data mining; Databases; Displays; Frequency; Natural languages; Power measurement; Power system modeling; Predictive models; Speech recognition; Vocabulary;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Automatic Speech Recognition and Understanding, 1997. Proceedings., 1997 IEEE Workshop on
Conference_Location :
Santa Barbara, CA
Print_ISBN :
0-7803-3698-4
Type :
conf
DOI :
10.1109/ASRU.1997.659014
Filename :
659014
Link To Document :
بازگشت