• DocumentCode
    858865
  • Title

    Context tree estimation for not necessarily finite memory processes, via BIC and MDL

  • Author

    Csiszár, Imre ; Talata, Zsolt

  • Author_Institution
    Stochastics Res. Group, Hungarian Acad. of Sci., Budapest
  • Volume
    52
  • Issue
    3
  • fYear
    2006
  • fDate
    3/1/2006 12:00:00 AM
  • Firstpage
    1007
  • Lastpage
    1016
  • Abstract
    The concept of context tree, usually defined for finite memory processes, is extended to arbitrary stationary ergodic processes (with finite alphabet). These context trees are not necessarily complete, and may be of infinite depth. The familiar Bayesian information criterion (BIC) and minimum description length (MDL) principles are shown to provide strongly consistent estimators of the context tree, via optimization of a criterion for hypothetical context trees of finite depth, allowed to grow with the sample size n as o(logn). Algorithms are provided to compute these estimators in O(n) time, and to compute them on-line for all i les n in o(nlogn) time
  • Keywords
    Bayes methods; estimation theory; information theory; trees (mathematics); BIC; Bayesian information criterion; MDL principle; arbitrary stationary ergodic processes; finite memory processes; hypothetical context tree estimation; minimum description length; Australia; Bayesian methods; Context modeling; Information theory; Mathematics; Pattern recognition; Probability; Statistical analysis; Statistical learning; Stochastic processes; Bayesian information criterion (BIC); consistent estimation; context tree; context tree maximization (CTM); infinite memory; minimum description length (MDL); model selection;
  • fLanguage
    English
  • Journal_Title
    Information Theory, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0018-9448
  • Type

    jour

  • DOI
    10.1109/TIT.2005.864431
  • Filename
    1603768