DocumentCode :
2019589
Title :
Error Bounds and Improved Probability Estimation using the Maximum Likelihood Set
Author :
Karakos, D. ; Khudanpur, S.
Author_Institution :
Johns Hopkins Univ., Baltimore
fYear :
2007
fDate :
24-29 June 2007
Firstpage :
1851
Lastpage :
1855
Abstract :
The maximum likelihood set (MLS) is a novel candidate for nonparametric probability estimation from small samples that permits incorporating prior or structural knowledge into the estimator. It is a set of probability distributions which assign to the observed type (or empirical distribution) a likelihood that is no lower than the likelihood they assign to any other type. The MLS has been shown to have many highly desirable properties, including strong consistency of MLS-based estimates; yet the probability that the MLS contains the data-generating distribution may be arbitrarily small. In this paper, we propose to overcome this shortcoming via an epsiv-fattening of the MLS. The proposed set, called the High Likelihood Set (HLS), with epsiv rarr 0 slowly in sample size, ensures that the HLS contains the data- generating distribution with arbitrarily large probability, while retaining most desirable properties of the MLS. In particular, the HLS provides a "high-probability" bound on the estimation error, and experimental results in statistical language modeling show improved operational performance from HLS-based estimates.
Keywords :
error statistics; maximum likelihood estimation; set theory; statistical distributions; data-generating distribution; error bound; high likelihood set; improved probability estimation; maximum likelihood set; nonparametric probability estimation; probability distribution; Bayesian methods; Entropy; Estimation error; High level synthesis; Maximum likelihood estimation; Multilevel systems; Natural languages; Probability distribution; Smoothing methods; Speech processing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information Theory, 2007. ISIT 2007. IEEE International Symposium on
Conference_Location :
Nice
Print_ISBN :
978-1-4244-1397-3
Type :
conf
DOI :
10.1109/ISIT.2007.4557150
Filename :
4557150
Link To Document :
بازگشت