DocumentCode
3246755
Title
Online optimization in parametric dynamic environments
Author
Hall, Eric C. ; Willett, Rebecca
Author_Institution
Electr. & Comput. Eng., Duke Univ., Durham, NC, USA
fYear
2013
fDate
2-4 Oct. 2013
Firstpage
33
Lastpage
38
Abstract
This paper describes a novel approach to online convex programming in dynamic settings. Many existing online learning methods are characterized via regret bounds that quantify the gap between the performance of the online algorithm relative to a comparator. In previous work, this comparator was either considered static over time, or admitted sublinear tracking regret bounds only when the comparator was piecewise constant or slowly varying. In contrast, the proposed method determines the best dynamical model from a parametric family to incorporate into the prediction process, and admits regret bounds which scale with the deviation of a comparator from that dynamical model. In other words, this approach can yield low regret relative to highly variable, dynamic comparators. This result is proved for loss functions corresponding to the negative log likelihood associated with an exponential family probability distribution, and several properties of the exponential family are exploited to ensure low regret.
Keywords
convex programming; dynamic programming; statistical distributions; comparator deviation; exponential family probability distribution; loss functions; negative log likelihood; online convex programming; online learning methods; online optimization; parametric dynamic model; prediction process; regret bounds; variable dynamic comparators; Adaptation models; Additives; Computational modeling; Heuristic algorithms; Mirrors; Optimization; Predictive models; Dynamical Models; Online Optimization; Regularization; Sequential Prediction; Stochastic Filtering; Tracking Regret;
fLanguage
English
Publisher
ieee
Conference_Titel
Communication, Control, and Computing (Allerton), 2013 51st Annual Allerton Conference on
Conference_Location
Monticello, IL
Print_ISBN
978-1-4799-3409-6
Type
conf
DOI
10.1109/Allerton.2013.6736502
Filename
6736502
Link To Document