مرکز منطقه ای اطلاع رساني علوم و فناوري - Interior-Point Methods for Full-Information and Bandit Online Learning

DocumentCode :

1504797

Title :

Interior-Point Methods for Full-Information and Bandit Online Learning

Author :

Abernethy, Jacob D. ; Hazan, Elad ; Rakhlin, Alexander

Author_Institution :

Dept. of Comput. Sci., Univ. of California Berkeley, Berkeley, CA, USA

Volume :

Issue :

fYear :

2012

fDate :

7/1/2012 12:00:00 AM

Firstpage :

4164

Lastpage :

4175

Abstract :

We study the problem of predicting individual sequences with linear loss with full and partial (or bandit) feed- back. Our main contribution is the first efficient algorithm for the problem of online linear optimization in the bandit setting which achieves the optimal Õ(√(T)) regret. In addition, for the full-information setting, we give a novel regret minimization algorithm. These results are made possible by the introduction of interior-point methods for convex optimization to online learning.

Keywords :

convex programming; decision making; iterative methods; learning (artificial intelligence); bandit online learning; bandit setting; convex optimization; efήcient algorithm; full-information; full-information setting; individual sequences prediction problem; interior-point methods; online linear optimization; regret minimization algorithm; Convex functions; Decision making; Ellipsoids; Minimization; Newton method; Optimization; Vectors; Bandit feedback; interior-point methods; online convex optimization; online learning;

fLanguage :

English

Journal_Title :

Information Theory, IEEE Transactions on

Publisher :

ieee

ISSN :

0018-9448

Type :

jour

DOI :

10.1109/TIT.2012.2192096

Filename :

6191328

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1504797