• DocumentCode
    589222
  • Title

    Balancing Public Cycle Sharing Schemes Using Independent Learners

  • Author

    Smith, Johan ; Dickens, Luke ; Broda, K.

  • Author_Institution
    Dept. of Comput., Imperial Coll., London, UK
  • Volume
    1
  • fYear
    2012
  • fDate
    12-15 Dec. 2012
  • Firstpage
    168
  • Lastpage
    173
  • Abstract
    This paper concerns the resource management problem arising in public cycle sharing schemes, when some docking stations become empty and remain so while others fill to capacity. To alleviate this, managing companies move bicycles between docking stations in order to maximise the number of satisfied customers while minimising the movement cost. We identify Reinforcement learning (RL) as the most promising technique for finding good movement strategies in these networks, but conventional function-approximation RL methods do not scale well here, due to the quadratic growth in number of actions with network size. We propose the use of cooperating agents, namely Independent Learners, to partition the action space. To overcome the well known issue of coordination in Independent Learners, we combine a novel scheduling approach for asynchronous learning, with a modified Gradient-descent Sarsa(λ) algorithm to manage variable step-sizes. Our method competes with, and scales more favourably than, single-agent RL on a selection of simulated networks.
  • Keywords
    bicycles; customer satisfaction; gradient methods; learning (artificial intelligence); multi-agent systems; network theory (graphs); transportation; asynchronous learning; cooperating agents; customer satisfaction; docking stations; function-approximation RL methods; independent learners; modified gradient-descent Sarsa algorithm; movement cost; movement strategies; network size; public cycle sharing schemes; reinforcement learning; resource management problem; scheduling approach; simulated networks; single-agent RL; step-size management; Bicycles; Convergence; Joints; Learning; Resource management; Schedules; Vectors; cooperative; cycle hire scheme; function approximation; independent learners; large discrete action spaces; multi-agent; reinforcement learning;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Machine Learning and Applications (ICMLA), 2012 11th International Conference on
  • Conference_Location
    Boca Raton, FL
  • Print_ISBN
    978-1-4673-4651-1
  • Type

    conf

  • DOI
    10.1109/ICMLA.2012.36
  • Filename
    6406607