DocumentCode
1190670
Title
Density estimation by stochastic complexity
Author
Rissanen, J. ; Speed, T.P. ; Yu, Bei
Author_Institution
IBM Almaden Res. Center, San Jose, CA, USA
Volume
38
Issue
2
fYear
1992
fDate
3/1/1992 12:00:00 AM
Firstpage
315
Lastpage
323
Abstract
The results by P. Hall and E.J. Hannan (1988) on optimization of histogram density estimators with equal bin widths by minimization of the stochastic complexity are extended and sharpened in two separate ways. As the first contribution, two generalized histogram estimators are constructed. The first has unequal bin widths which, together with the number of the bins, are determined by minimization of the stochastic complexity using dynamic programming. The other estimator consists of a mixture of equal bin width estimators, each of which is defined by the associated stochastic complexity. As the main contribution in the present work, two theorems are proved, which together extend the universal coding theorems to a large class of data generating densities. The first gives an asymptotic upper bound for the code redundancy in the order of magnitude, achieved with a special predictive type of histogram estimator, which sharpens a related bound. The second theorem states that this bound cannot be improved upon by any code whatsoever.<>
Keywords
dynamic programming; encoding; estimation theory; information theory; minimisation; stochastic processes; asymptotic upper bound; code redundancy; data generating densities; density estimation; dynamic programming; equal bin widths; generalized histogram estimators; minimization; minimum description length principle; stochastic complexity; unequal bin widths; universal coding theorems; Codes; Density functional theory; Dynamic programming; Histograms; Kernel; Probability distribution; Statistics; Stochastic processes; Upper bound;
fLanguage
English
Journal_Title
Information Theory, IEEE Transactions on
Publisher
ieee
ISSN
0018-9448
Type
jour
DOI
10.1109/18.119689
Filename
119689
Link To Document