Title :
On-line algorithms for combining language models
Author :
Kalai, Adam ; Chen, Stanley ; Blum, Avrim ; Rosenfeld, Ronald
Author_Institution :
Sch. of Comput. Sci., Carnegie Mellon Univ., Pittsburgh, PA, USA
Abstract :
Multiple language models are combined for many tasks in language modeling, such as domain and topic adaptation. In this work, we compare on-line algorithms from machine learning to existing algorithms for combining language models. On-line algorithms developed for this problem have parameters that are updated dynamically to adapt to a data set during evaluation. On-line analysis provides guarantees that these algorithms will perform nearly as well as the best model chosen in hindsight from a large class of models, e.g., the set of all static mixtures. We describe several on-line algorithms and present results comparing these techniques with existing language modeling combination methods on the task of domain adaptation. We demonstrate that, in some situations, on-line techniques can significantly outperform static mixtures (by over 10% in terms of perplexity) and are especially effective when the nature of the test data is unknown or changes over time
Keywords :
adaptive systems; learning (artificial intelligence); natural languages; online operation; speech processing; adaptive techniques; data set; domain adaptation; language modeling combination methods; machine learning; multiple language models; on-line algorithms; on-line analysis; perplexity; static mixtures; test data; topic adaptation; Adaptation model; Algorithm design and analysis; Broadcasting; Computer science; Hidden Markov models; Machine learning; Machine learning algorithms; Performance analysis; Switches; Testing;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1999. Proceedings., 1999 IEEE International Conference on
Conference_Location :
Phoenix, AZ
Print_ISBN :
0-7803-5041-3
DOI :
10.1109/ICASSP.1999.759774