• DocumentCode
    1157844
  • Title

    Branching bandits and Klimov´s problem: achievable region and side constraints

  • Author

    Bertsimas, Dimitris ; Paschalidis, Ioannis Ch ; Tsitsiklis, John N.

  • Author_Institution
    Lab. for Inf. & Decision Syst., MIT, Cambridge, MA, USA
  • Volume
    40
  • Issue
    12
  • fYear
    1995
  • fDate
    12/1/1995 12:00:00 AM
  • Firstpage
    2063
  • Lastpage
    2075
  • Abstract
    We consider the average cost branching bandits problem and its special case known as Klimov´s problem. We consider the vector n whose components are the mean number of bandits (or customers) of each type that are present. We characterize fully the achievable region, that is, the set of all possible vectors n that can be obtained by considering all possible policies. While the original description of the achievable region involves exponentially many constraints, we also develop an alternative description that involves only O(R2) variables and constraints, where R is the number of bandit types (or customer classes). We then consider the problem of minimizing a linear function of n subject to L additional linear constraints on n. We show that optimal policies can be obtained by randomizing between L+1 strict priority policies that can be found efficiently (in polynomial time) using linear programming techniques
  • Keywords
    computational complexity; constraint theory; linear programming; operations research; queueing theory; Klimov´s problem; M/GI/I queue; achievable region; average cost; branching bandits problem; constraints; linear function; linear programming; polynomial time; side constraints; single server multiclass queue; Costs; Feedback; Linear programming; Manufacturing processes; Network servers; Operations research; Polynomials; Routing; Student members; Vectors;
  • fLanguage
    English
  • Journal_Title
    Automatic Control, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0018-9286
  • Type

    jour

  • DOI
    10.1109/9.478231
  • Filename
    478231