DocumentCode
31026
Title
The Generalization Ability of Online Algorithms for Dependent Data
Author
Agarwal, Alekh ; Duchi, John C.
Author_Institution
Dept. of Electr. Eng. & Comput. Sci., Univ. of California, Berkeley, Berkeley, CA, USA
Volume
59
Issue
1
fYear
2013
fDate
Jan. 2013
Firstpage
573
Lastpage
587
Abstract
We study the generalization performance of online learning algorithms trained on samples coming from a dependent source of data. We show that the generalization error of any stable online algorithm concentrates around its regret-an easily computable statistic of the online performance of the algorithm-when the underlying ergodic process is β- or φ -mixing. We show high-probability error bounds assuming the loss function is convex, and we also establish sharp convergence rates and deviation bounds for strongly convex losses and several linear prediction problems such as linear and logistic regression, least-squares SVM, and boosting on dependent data. In addition, our results have straightforward applications to stochastic optimization with dependent data, and our analysis requires only martingale convergence arguments; we need not rely on more powerful statistical tools such as empirical process theory.
Keywords
regression analysis; support vector machines; β-mixing; φ-mixing; computable statistic; dependent data; deviation bounds; empirical process theory; ergodic process; generalization ability; generalization error; high-probability error bounds; least-squares SVM; linear prediction problems; linear regression; logistic regression; online algorithms; online learning algorithms; sharp convergence rates; stable online algorithm; statistical tools; stochastic optimization; strongly convex losses; Algorithm design and analysis; Convergence; Loss measurement; Markov processes; Prediction algorithms; Probability; Dependent observations; generalization bounds; linear prediction; online learning; statistical learning theory;
fLanguage
English
Journal_Title
Information Theory, IEEE Transactions on
Publisher
ieee
ISSN
0018-9448
Type
jour
DOI
10.1109/TIT.2012.2212414
Filename
6263295
Link To Document