DocumentCode :
1434649
Title :
The minimum description length principle in coding and modeling
Author :
Barron, Andrew ; Rissanen, Jorma ; Yu, Bin
Author_Institution :
Dept. of Stat., Yale Univ., New Haven, CT, USA
Volume :
44
Issue :
6
fYear :
1998
fDate :
10/1/1998 12:00:00 AM
Firstpage :
2743
Lastpage :
2760
Abstract :
We review the principles of minimum description length and stochastic complexity as used in data compression and statistical modeling. Stochastic complexity is formulated as the solution to optimum universal coding problems extending Shannon´s basic source coding theorem. The normalized maximized likelihood, mixture, and predictive codings are each shown to achieve the stochastic complexity to within asymptotically vanishing terms. We assess the performance of the minimum description length criterion both from the vantage point of quality of data compression and accuracy of statistical inference. Context tree modeling, density estimation, and model selection in Gaussian linear regression serve as examples
Keywords :
computational complexity; data compression; information theory; maximum likelihood estimation; modelling; prediction theory; reviews; source coding; statistical analysis; stochastic processes; Gaussian linear regression; Shannon coding; context tree modeling; data compression; density estimation; minimum description length principle; mixture coding; model selection; normalized maximized likelihood coding; optimum universal coding problem; predictive coding; review; source coding theorem; statistical inference; statistical modeling; stochastic complexity; Context modeling; Data compression; Linear regression; Maximum likelihood decoding; Maximum likelihood estimation; Predictive coding; Probability distribution; Source coding; Statistical distributions; Stochastic processes;
fLanguage :
English
Journal_Title :
Information Theory, IEEE Transactions on
Publisher :
ieee
ISSN :
0018-9448
Type :
jour
DOI :
10.1109/18.720554
Filename :
720554
Link To Document :
بازگشت