• DocumentCode
    1317546
  • Title

    Online Learning of Noisy Data

  • Author

    Cesa-Bianchi, Nicoló ; Shalev-Shwartz, Shai ; Shamir, Ohad

  • Author_Institution
    Dipt. di Sci. dell´´Inf., Univ. degli Studi di Milano, Milan, Italy
  • Volume
    57
  • Issue
    12
  • fYear
    2011
  • Firstpage
    7907
  • Lastpage
    7931
  • Abstract
    We study online learning of linear and kernel-based predictors, when individual examples are corrupted by random noise, and both examples and noise type can be chosen adversarially and change over time. We begin with the setting where some auxiliary information on the noise distribution is provided, and we wish to learn predictors with respect to the squared loss. Depending on the auxiliary information, we show how one can learn linear and kernel-based predictors, using just 1 or 2 noisy copies of each example. We then turn to discuss a general setting where virtually nothing is known about the noise distribution, and one wishes to learn with respect to general losses and using linear and kernel-based predictors. We show how this can be achieved using a random, essentially constant number of noisy copies of each example. Allowing multiple copies cannot be avoided: Indeed, we show that the setting becomes impossible when only one noisy copy of each instance can be accessed. To obtain our results we introduce several novel techniques, some of which might be of independent interest.
  • Keywords
    data analysis; learning (artificial intelligence); kernel-based predictors; linear based predictors; machine learning; noisy data; online learning; Hilbert space; Noise measurement; Polynomials; Prediction algorithms;
  • fLanguage
    English
  • Journal_Title
    Information Theory, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0018-9448
  • Type

    jour

  • DOI
    10.1109/TIT.2011.2164053
  • Filename
    6015553