Title of article :
The minimum description length principle for probability density estimation by regular histograms
Author/Authors :
François Chapeau-Blondeau، نويسنده , , David Rousseau، نويسنده ,
Issue Information :
روزنامه با شماره پیاپی سال 2009
Pages :
16
From page :
3969
To page :
3984
Abstract :
The minimum description length principle is a general methodology for statistical modeling and inference that selects the best explanation for observed data as the one allowing the shortest description of them. Application of this principle to the important task of probability density estimation by histograms was previously proposed. We review this approach and provide additional illustrative examples and an application to real-world data, with a presentation emphasizing intuition and concrete arguments. We also consider alternative ways of measuring the description lengths, that can be found to be more suited in this context. We explicitly exhibit, analyze and compare, the complete forms of the description lengths with formulas involving the information entropy and redundancy of the data, and not given elsewhere. Histogram estimation as performed here naturally extends to multidimensional data, and offers for them flexible and optimal subquantization schemes. The framework can be very useful for modeling and reduction of complexity of observed data, based on a general principle from statistical information theory, and placed within a unifying informational perspective.
Journal title :
Physica A Statistical Mechanics and its Applications
Serial Year :
2009
Journal title :
Physica A Statistical Mechanics and its Applications
Record number :
873299
Link To Document :
بازگشت