مرکز منطقه ای اطلاع رساني علوم و فناوري - Minimum description length induction, Bayesianism, and Kolmogorov complexity

DocumentCode :

1305918

Title :

Minimum description length induction, Bayesianism, and Kolmogorov complexity

Author :

Vitányi, Paul M B ; Li, Ming

Author_Institution :

CWI, Amsterdam, Netherlands

Volume :

Issue :

fYear :

2000

fDate :

3/1/2000 12:00:00 AM

Firstpage :

446

Lastpage :

464

Abstract :

The relationship between the Bayesian approach and the minimum description length approach is established. We sharpen and clarify the general modeling principles minimum description length (MDL) and minimum message length (MML), abstracted as the ideal MDL principle and defined from Bayes´s rule by means of Kolmogorov complexity. The basic condition under which the ideal principle should be applied is encapsulated as the fundamental inequality, which in broad terms states that the principle is valid when the data are random, relative to every contemplated hypothesis and also these hypotheses are random relative to the (universal) prior. The ideal principle states that the prior probability associated with the hypothesis should be given by the algorithmic universal probability, and the sum of the log universal probability of the model plus the log of the probability of the data given the model should be minimized. If we restrict the model class to finite sets then application of the ideal principle turns into Kolmogorov´s minimal sufficient statistic. In general, we show that data compression is almost always the best strategy, both in model selection and prediction

Keywords :

computational complexity; data compression; probability; Bayes´s rule; Bayesianism; Kolmogorov complexity; Kolmogorov´s minimal sufficient statistic; algorithmic universal probability; fundamental inequality; hypothesis; ideal MDL principle; log universal probability; minimum description length induction; minimum message length; prior probability; Bayesian methods; Computer science; Data compression; Intersymbol interference; Machine learning; Predictive models; Probability; Statistical distributions; Statistics; Testing;

fLanguage :

English

Journal_Title :

Information Theory, IEEE Transactions on

Publisher :

ieee

ISSN :

0018-9448

Type :

jour

DOI :

10.1109/18.825807

Filename :

825807

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1305918