مرکز منطقه ای اطلاع رساني علوم و فناوري - Multi-armed bandits with switching penalties

DocumentCode :

766995

Title :

Multi-armed bandits with switching penalties

Author :

Asawa, Manjari ; Teneketzis, Demosthenis

Author_Institution :

Dept. of Electr. Eng. & Comput. Sci., Michigan Univ., Ann Arbor, MI, USA

Volume :

Issue :

fYear :

1996

fDate :

3/1/1996 12:00:00 AM

Firstpage :

328

Lastpage :

348

Abstract :

The multi-armed bandit problem with switching penalties (switching cost and switching delays) is investigated. It is shown that under an optimal policy, decisions about the processor allocation need to be made only at stopping times that achieve an appropriate index, the well-known “Gittins index” or a “switching index” that is defined for switching cost and switching delays. An algorithm for the computation of the “switching index” is presented. Furthermore, sufficient conditions for optimality of allocation strategies, based on limited look-ahead techniques, are established. These conditions together with the above-mentioned feature of optimal scheduling policies simplify the search for an optimal allocation policy. For a special class of multi-armed bandits (scheduling of parallel queues with switching penalties and no arrivals), it is shown that the aforementioned property of optimal policies is sufficient to determine an optimal allocation strategy. In general, the determination of optimal allocation policies remains a difficult and challenging task

Keywords :

game theory; optimisation; queueing theory; resource allocation; Gittins index; multi-armed bandits; optimal policy; optimal scheduling policies; parallel queues; processor allocation; stopping times; switching cost; switching delays; switching penalties; Cost function; Delay; Infinite horizon; Job shop scheduling; Manufacturing; Optimal scheduling; Processor scheduling; Resource management; Stochastic processes; Sufficient conditions;

fLanguage :

English

Journal_Title :

Automatic Control, IEEE Transactions on

Publisher :

ieee

ISSN :

0018-9286

Type :

jour

DOI :

10.1109/9.486316

Filename :

486316

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=766995