• DocumentCode
    3273381
  • Title

    Simulation Studies of Multi-armed Bandits with Covariates (Invited Paper)

  • Author

    Pavlidis, Nicos G. ; Tasoulis, Dimitris K. ; Hand, David J.

  • fYear
    2008
  • fDate
    1-3 April 2008
  • Firstpage
    493
  • Lastpage
    498
  • Abstract
    We evaluate the performance of a number of action-selection methods on the multi-armed bandit problem with covariates. We resort to simulations because our primary concern is the speed with which the different methods identify the optimal policy, and not their asymptotic behaviour. The experimental results show that the performance of the ε-greedy methods is robust, while the interval estimation strategies achieve the fastest learning of the optimal policy. We propose a metric to quantify the difficulty of a multi-armed bandit problem with covariates and show that there is a trade-off between the satisfaction of the different performance measures.
  • Keywords
    Clinical trials; Computational modeling; Computer simulation; Educational institutions; Fires; Information retrieval; Mathematical model; Probability distribution; Robustness; Routing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Modeling and Simulation, 2008. UKSIM 2008. Tenth International Conference on
  • Conference_Location
    Cambridge, UK
  • Print_ISBN
    0-7695-3114-8
  • Type

    conf

  • DOI
    10.1109/UKSIM.2008.86
  • Filename
    4488981