DocumentCode
1157844
Title
Branching bandits and Klimov´s problem: achievable region and side constraints
Author
Bertsimas, Dimitris ; Paschalidis, Ioannis Ch ; Tsitsiklis, John N.
Author_Institution
Lab. for Inf. & Decision Syst., MIT, Cambridge, MA, USA
Volume
40
Issue
12
fYear
1995
fDate
12/1/1995 12:00:00 AM
Firstpage
2063
Lastpage
2075
Abstract
We consider the average cost branching bandits problem and its special case known as Klimov´s problem. We consider the vector n whose components are the mean number of bandits (or customers) of each type that are present. We characterize fully the achievable region, that is, the set of all possible vectors n that can be obtained by considering all possible policies. While the original description of the achievable region involves exponentially many constraints, we also develop an alternative description that involves only O(R2) variables and constraints, where R is the number of bandit types (or customer classes). We then consider the problem of minimizing a linear function of n subject to L additional linear constraints on n. We show that optimal policies can be obtained by randomizing between L+1 strict priority policies that can be found efficiently (in polynomial time) using linear programming techniques
Keywords
computational complexity; constraint theory; linear programming; operations research; queueing theory; Klimov´s problem; M/GI/I queue; achievable region; average cost; branching bandits problem; constraints; linear function; linear programming; polynomial time; side constraints; single server multiclass queue; Costs; Feedback; Linear programming; Manufacturing processes; Network servers; Operations research; Polynomials; Routing; Student members; Vectors;
fLanguage
English
Journal_Title
Automatic Control, IEEE Transactions on
Publisher
ieee
ISSN
0018-9286
Type
jour
DOI
10.1109/9.478231
Filename
478231
Link To Document