Online optimization in parametric dynamic environments

Author

Hall, Eric C. ; Willett, Rebecca

Author_Institution

Electr. & Comput. Eng., Duke Univ., Durham, NC, USA

fYear

2013

fDate

2-4 Oct. 2013

Firstpage

33

Lastpage

38

Abstract

This paper describes a novel approach to online convex programming in dynamic settings. Many existing online learning methods are characterized via regret bounds that quantify the gap between the performance of the online algorithm relative to a comparator. In previous work, this comparator was either considered static over time, or admitted sublinear tracking regret bounds only when the comparator was piecewise constant or slowly varying. In contrast, the proposed method determines the best dynamical model from a parametric family to incorporate into the prediction process, and admits regret bounds which scale with the deviation of a comparator from that dynamical model. In other words, this approach can yield low regret relative to highly variable, dynamic comparators. This result is proved for loss functions corresponding to the negative log likelihood associated with an exponential family probability distribution, and several properties of the exponential family are exploited to ensure low regret.

Keywords

convex programming; dynamic programming; statistical distributions; comparator deviation; exponential family probability distribution; loss functions; negative log likelihood; online convex programming; online learning methods; online optimization; parametric dynamic model; prediction process; regret bounds; variable dynamic comparators; Adaptation models; Additives; Computational modeling; Heuristic algorithms; Mirrors; Optimization; Predictive models; Dynamical Models; Online Optimization; Regularization; Sequential Prediction; Stochastic Filtering; Tracking Regret;

fLanguage

English

Publisher

ieee

Conference_Titel

Communication, Control, and Computing (Allerton), 2013 51st Annual Allerton Conference on

Conference_Location

Monticello, IL

Print_ISBN

978-1-4799-3409-6

Type

conf

DOI

10.1109/Allerton.2013.6736502

Filename

6736502