• DocumentCode
    3561942
  • Title

    Branching bandits and Klimov´s problem: achievable region and side constraints

  • Author

    Bertsimas, Dimitris ; Paschalidis, Ioannis Ch ; Tsitsiklis, John N.

  • Author_Institution
    Lab. for Inf. & Decision Syst., MIT, Cambridge, MA, USA
  • Volume
    1
  • fYear
    1994
  • Firstpage
    174
  • Abstract
    Considers the average cost branching bandits problem and its special case known as Klimov´s problem. The authors consider the vector n whose components are the mean number of bandits (or customers) of each type that are present. The authors characterize fully the achievable region, that is, the set of all possible vectors n that can be obtained by considering all possible policies. While the original description of the achievable region involves exponentially many constraints, the authors also develop an alternative description that involves only O(R 2) variables and constraints, where R is the number of bandit types (or customer classes). The authors then consider the problem of minimizing a linear function of n subject to L additional linear constraints on n. The authors show that optimal policies can be obtained by randomizing between L+1 strict priority policies that can be found efficiently (in polynomial time) using linear programming techniques
  • Keywords
    discrete event systems; game theory; linear programming; queueing theory; stochastic systems; Klimov´s problem; achievable region; average cost branching bandits problem; linear function minimisation; linear programming techniques; optimal policies; side constraints; strict priority policies; Control systems; Cost function; Feedback; Laboratories; Linear programming; Polynomials; Stochastic processes; Stochastic systems;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Decision and Control, 1994., Proceedings of the 33rd IEEE Conference on
  • Print_ISBN
    0-7803-1968-0
  • Type

    conf

  • DOI
    10.1109/CDC.1994.411026
  • Filename
    411026