• DocumentCode
    1190670
  • Title

    Density estimation by stochastic complexity

  • Author

    Rissanen, J. ; Speed, T.P. ; Yu, Bei

  • Author_Institution
    IBM Almaden Res. Center, San Jose, CA, USA
  • Volume
    38
  • Issue
    2
  • fYear
    1992
  • fDate
    3/1/1992 12:00:00 AM
  • Firstpage
    315
  • Lastpage
    323
  • Abstract
    The results by P. Hall and E.J. Hannan (1988) on optimization of histogram density estimators with equal bin widths by minimization of the stochastic complexity are extended and sharpened in two separate ways. As the first contribution, two generalized histogram estimators are constructed. The first has unequal bin widths which, together with the number of the bins, are determined by minimization of the stochastic complexity using dynamic programming. The other estimator consists of a mixture of equal bin width estimators, each of which is defined by the associated stochastic complexity. As the main contribution in the present work, two theorems are proved, which together extend the universal coding theorems to a large class of data generating densities. The first gives an asymptotic upper bound for the code redundancy in the order of magnitude, achieved with a special predictive type of histogram estimator, which sharpens a related bound. The second theorem states that this bound cannot be improved upon by any code whatsoever.<>
  • Keywords
    dynamic programming; encoding; estimation theory; information theory; minimisation; stochastic processes; asymptotic upper bound; code redundancy; data generating densities; density estimation; dynamic programming; equal bin widths; generalized histogram estimators; minimization; minimum description length principle; stochastic complexity; unequal bin widths; universal coding theorems; Codes; Density functional theory; Dynamic programming; Histograms; Kernel; Probability distribution; Statistics; Stochastic processes; Upper bound;
  • fLanguage
    English
  • Journal_Title
    Information Theory, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0018-9448
  • Type

    jour

  • DOI
    10.1109/18.119689
  • Filename
    119689