• DocumentCode
    1257496
  • Title

    PAC-Bayesian Inequalities for Martingales

  • Author

    Seldin, Yevgeny ; Laviolette, François ; Cesa-Bianchi, Nicoló ; Shawe-Taylor, John ; Auer, Peter

  • Author_Institution
    Max Planck Inst. for Intell. Syst., Tubingen, Germany
  • Volume
    58
  • Issue
    12
  • fYear
    2012
  • Firstpage
    7086
  • Lastpage
    7093
  • Abstract
    We present a set of high-probability inequalities that control the concentration of weighted averages of multiple (possibly uncountably many) simultaneously evolving and interdependent martingales. Our results extend the PAC-Bayesian (probably approximately correct) analysis in learning theory from the i.i.d. setting to martingales opening the way for its application to importance weighted sampling, reinforcement learning, and other interactive learning domains, as well as many other domains in probability theory and statistics, where martingales are encountered. We also present a comparison inequality that bounds the expectation of a convex function of a martingale difference sequence shifted to the [0, 1] interval by the expectation of the same function of independent Bernoulli random variables. This inequality is applied to derive a tighter analog of Hoeffding-Azuma´s inequality.
  • Keywords
    Bayes methods; learning (artificial intelligence); probability; stochastic processes; Hoeffding-Azuma inequality; PAC-Bayesian inequalities; convex function; high-probability inequality; independent Bernoulli random variables; interactive learning domains; interdependent martingales; learning theory; martingale difference sequence; probability theory; probably approximately correct analysis; reinforcement learning; weighted sampling; Bayesian methods; Convex functions; Entropy; Learning systems; Random variables; Bernstein´s inequality; Hoeffding–Azuma´s inequality; PAC-Bayesian bounds; martingales;
  • fLanguage
    English
  • Journal_Title
    Information Theory, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0018-9448
  • Type

    jour

  • DOI
    10.1109/TIT.2012.2211334
  • Filename
    6257492