مرکز منطقه ای اطلاع رساني علوم و فناوري - Balancing Public Cycle Sharing Schemes Using Independent Learners

DocumentCode :

589222

Title :

Balancing Public Cycle Sharing Schemes Using Independent Learners

Author :

Smith, Johan ; Dickens, Luke ; Broda, K.

Author_Institution :

Dept. of Comput., Imperial Coll., London, UK

Volume :

fYear :

2012

fDate :

12-15 Dec. 2012

Firstpage :

168

Lastpage :

173

Abstract :

This paper concerns the resource management problem arising in public cycle sharing schemes, when some docking stations become empty and remain so while others fill to capacity. To alleviate this, managing companies move bicycles between docking stations in order to maximise the number of satisfied customers while minimising the movement cost. We identify Reinforcement learning (RL) as the most promising technique for finding good movement strategies in these networks, but conventional function-approximation RL methods do not scale well here, due to the quadratic growth in number of actions with network size. We propose the use of cooperating agents, namely Independent Learners, to partition the action space. To overcome the well known issue of coordination in Independent Learners, we combine a novel scheduling approach for asynchronous learning, with a modified Gradient-descent Sarsa(λ) algorithm to manage variable step-sizes. Our method competes with, and scales more favourably than, single-agent RL on a selection of simulated networks.

Keywords :

bicycles; customer satisfaction; gradient methods; learning (artificial intelligence); multi-agent systems; network theory (graphs); transportation; asynchronous learning; cooperating agents; customer satisfaction; docking stations; function-approximation RL methods; independent learners; modified gradient-descent Sarsa algorithm; movement cost; movement strategies; network size; public cycle sharing schemes; reinforcement learning; resource management problem; scheduling approach; simulated networks; single-agent RL; step-size management; Bicycles; Convergence; Joints; Learning; Resource management; Schedules; Vectors; cooperative; cycle hire scheme; function approximation; independent learners; large discrete action spaces; multi-agent; reinforcement learning;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Machine Learning and Applications (ICMLA), 2012 11th International Conference on

Conference_Location :

Boca Raton, FL

Print_ISBN :

978-1-4673-4651-1

Type :

conf

DOI :

10.1109/ICMLA.2012.36

Filename :

6406607

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=589222