DocumentCode :
589222
Title :
Balancing Public Cycle Sharing Schemes Using Independent Learners
Author :
Smith, Johan ; Dickens, Luke ; Broda, K.
Author_Institution :
Dept. of Comput., Imperial Coll., London, UK
Volume :
1
fYear :
2012
fDate :
12-15 Dec. 2012
Firstpage :
168
Lastpage :
173
Abstract :
This paper concerns the resource management problem arising in public cycle sharing schemes, when some docking stations become empty and remain so while others fill to capacity. To alleviate this, managing companies move bicycles between docking stations in order to maximise the number of satisfied customers while minimising the movement cost. We identify Reinforcement learning (RL) as the most promising technique for finding good movement strategies in these networks, but conventional function-approximation RL methods do not scale well here, due to the quadratic growth in number of actions with network size. We propose the use of cooperating agents, namely Independent Learners, to partition the action space. To overcome the well known issue of coordination in Independent Learners, we combine a novel scheduling approach for asynchronous learning, with a modified Gradient-descent Sarsa(λ) algorithm to manage variable step-sizes. Our method competes with, and scales more favourably than, single-agent RL on a selection of simulated networks.
Keywords :
bicycles; customer satisfaction; gradient methods; learning (artificial intelligence); multi-agent systems; network theory (graphs); transportation; asynchronous learning; cooperating agents; customer satisfaction; docking stations; function-approximation RL methods; independent learners; modified gradient-descent Sarsa algorithm; movement cost; movement strategies; network size; public cycle sharing schemes; reinforcement learning; resource management problem; scheduling approach; simulated networks; single-agent RL; step-size management; Bicycles; Convergence; Joints; Learning; Resource management; Schedules; Vectors; cooperative; cycle hire scheme; function approximation; independent learners; large discrete action spaces; multi-agent; reinforcement learning;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Machine Learning and Applications (ICMLA), 2012 11th International Conference on
Conference_Location :
Boca Raton, FL
Print_ISBN :
978-1-4673-4651-1
Type :
conf
DOI :
10.1109/ICMLA.2012.36
Filename :
6406607
Link To Document :
بازگشت