DocumentCode :
133738
Title :
Fast learning of approximation policies for coordination in distributed networks
Author :
Shuping Liu ; Panangadan, Anand ; Raghavendra, Cauligi S. ; Madni, Asad M.
Author_Institution :
Ming Hsieh Dept. of Electr. Eng., Univ. of Southern California, Los Angeles, CA, USA
fYear :
2014
fDate :
3-7 Aug. 2014
Firstpage :
285
Lastpage :
290
Abstract :
This paper presents the Average-Max Reinforcement Learning (AMRL) algorithm that can be used to approximate a global policy of a Markov Decision Process (MDP) as a set of local policies that can be executed in a partially observable environment. The local policies are obtained by reinforcement learning and averaging state-action tables under a stochastic process model. This approach overcomes the scalability problem that arises when a large MDP has to be solved exactly. The approach is motivated by the problem of computing coordination policies for correlated but distributed sensors. We demonstrate the performance of this learning scheme on a simulation of a wireless body sensor network. These results show that the performance of the AMRL algorithm is significantly better than a random policy and is close to the optimal policy that can be obtained from solving a global MDP. The results also show that the AMRL algorithm is scalable to networks represented by large state spaces.
Keywords :
Markov processes; approximation theory; body sensor networks; decision making; distributed sensors; learning (artificial intelligence); AMRL algorithm; Markov decision process; approximation policies; average-max reinforcement learning algorithm; averaging state-action tables; coordination policies; distributed networks; distributed sensors; global MDP; learning scheme; local policies; optimal policy; partially observable environment; random policy; scalability problem; state spaces; stochastic process model; wireless body sensor network; Approximation algorithms; Artificial neural networks; Indexes; Microcontrollers; Silicon; Tin; Discrete; Distributed; Intelligent; Reinforcement Learning; Sensor Networks;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
World Automation Congress (WAC), 2014
Conference_Location :
Waikoloa, HI
Type :
conf
DOI :
10.1109/WAC.2014.6935892
Filename :
6935892
Link To Document :
بازگشت