DocumentCode :
1371821
Title :
Maximum entropy language modeling and the smoothing problem
Author :
Martin, Stephen C. ; Ney, Hermann ; Hamacher, C.
Author_Institution :
Lehrstuhl fur Inf. VI, Tech. Hochschule Aachen
Volume :
8
Issue :
5
fYear :
2000
fDate :
9/1/2000 12:00:00 AM
Firstpage :
626
Lastpage :
632
Abstract :
This paper discusses various aspects of smoothing techniques in maximum entropy language modeling. This topic is typically not addressed in literature. The results can be summarized in four statements: 1) straightforward maximum entropy models with nested features, e.g., tri-, bi-, and uni-grams, result in unsmoothed relative frequencies models, 2) maximum entropy models with nested features and discounted feature counts approximate backing-off smoothed relative frequencies models with Kneser´s advanced marginal back-off distribution. This explains some of the reported success of maximum entropy models in the past. 3) We give perplexity results for nested and nonnested features, e.g., trigrams and distance-trigrams, on a 4 million word subset of the Wall Street Journal Corpus. From these results we conclude that the smoothing method has more effect on the perplexity than the method of how to combine the different types of features. 4) We show perplexity results for nonnested features using log-linear interpolation of conventionally smoothed language models, giving evidence that this approach may be a first step to overcome the smoothing problem in the context of maximum entropy
Keywords :
computational linguistics; interpolation; maximum entropy methods; smoothing methods; Kneser´s advanced marginal back-off distribution; backing-off smoothed relative frequencies models; bigram; discounted feature counts; log-linear interpolation; maximum entropy language modeling; maximum entropy models; nested feature; nonnested features; perplexity; smoothing problem; trigram; unigram; unsmoothed relative frequencies models; Context modeling; Entropy; Frequency estimation; Glass; History; Interpolation; Natural language processing; Parameter estimation; Smoothing methods;
fLanguage :
English
Journal_Title :
Speech and Audio Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1063-6676
Type :
jour
DOI :
10.1109/89.861385
Filename :
861385
Link To Document :
بازگشت