• DocumentCode
    1085744
  • Title

    On the generalization ability of on-line learning algorithms

  • Author

    Cesa-Bianchi, Nicolò ; Conconi, Alex ; Gentile, Claudio

  • Author_Institution
    Univ. of Milan, Italy
  • Volume
    50
  • Issue
    9
  • fYear
    2004
  • Firstpage
    2050
  • Lastpage
    2057
  • Abstract
    In this paper, it is shown how to extract a hypothesis with small risk from the ensemble of hypotheses generated by an arbitrary on-line learning algorithm run on an independent and identically distributed (i.i.d.) sample of data. Using a simple large deviation argument, we prove tight data-dependent bounds for the risk of this hypothesis in terms of an easily computable statistic Mn associated with the on-line performance of the ensemble. Via sharp pointwise bounds on Mn, we then obtain risk tail bounds for kernel perceptron algorithms in terms of the spectrum of the empirical kernel matrix. These bounds reveal that the linear hypotheses found via our approach achieve optimal tradeoffs between hinge loss and margin size over the class of all linear functions, an issue that was left open by previous results. A distinctive feature of our approach is that the key tools for our analysis come from the model of prediction of individual sequences; i.e., a model making no probabilistic assumptions on the source generating the data. In fact, these tools turn out to be so powerful that we only need very elementary statistical facts to obtain our final risk bounds.
  • Keywords
    generalisation (artificial intelligence); learning (artificial intelligence); pattern recognition; perceptrons; empirical kernel matrix; independent identically distributed data; kernel perceptron algorithms; linear hypotheses; on-line learning algorithms; pattern recognition; Data mining; Fasteners; Information processing; Kernel; Pattern recognition; Predictive models; Random variables; Statistical distributions; Statistical learning; Tail; Kernel functions; on-line learning; pattern recognition; perceptron algorithm; statistical learning theory;
  • fLanguage
    English
  • Journal_Title
    Information Theory, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0018-9448
  • Type

    jour

  • DOI
    10.1109/TIT.2004.833339
  • Filename
    1327806