• DocumentCode
    1727641
  • Title

    Pairwise Regression with Upper Confidence Bound for Contextual Bandit with Multiple Actions

  • Author

    Ya-Hsuan Chang ; Hsuan-Tien Lin

  • Author_Institution
    Dept. of Comput. Sci. & Inf. Eng., Nat. Taiwan Univ., Taipei, Taiwan
  • fYear
    2013
  • Firstpage
    19
  • Lastpage
    24
  • Abstract
    The contextual bandit problem is typically used to model online applications such as article recommendation. However, the problem cannot fully meet certain needs of these applications, such as performing multiple actions at the same time. We defined a new Contextual Bandit Problem with Multiple Actions (CBMA), which is an extension of the traditional contextual bandit problem and fits the online applications better. We adapt some existing contextual bandit algorithms for our CBMA problem, and developed the new Pair wise Regression with Upper Confidence Bound (PairUCB) algorithm which addresses the new properties of the new CBMA problem. Experimental results demonstrate that PairUCB significantly outperforms other approaches.
  • Keywords
    game theory; learning (artificial intelligence); regression analysis; CBMA; PairUCB; contextual bandit problem with multiple actions; multiple actions; online applications; pairwise regression with upper confidence bound; Context; Context modeling; Linear regression; Mathematical model; Stochastic processes; Uncertainty; Vectors; contextual bandit; machine learning; upper confidence bound;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Technologies and Applications of Artificial Intelligence (TAAI), 2013 Conference on
  • Conference_Location
    Taipei
  • Print_ISBN
    978-1-4799-2528-5
  • Type

    conf

  • DOI
    10.1109/TAAI.2013.18
  • Filename
    6783837