مرکز منطقه ای اطلاع رساني علوم و فناوري - Fast learning of approximation policies for coordination in distributed networks

DocumentCode :

133738

Title :

Fast learning of approximation policies for coordination in distributed networks

Author :

Shuping Liu ; Panangadan, Anand ; Raghavendra, Cauligi S. ; Madni, Asad M.

Author_Institution :

Ming Hsieh Dept. of Electr. Eng., Univ. of Southern California, Los Angeles, CA, USA

fYear :

2014

fDate :

3-7 Aug. 2014

Firstpage :

285

Lastpage :

290

Abstract :

This paper presents the Average-Max Reinforcement Learning (AMRL) algorithm that can be used to approximate a global policy of a Markov Decision Process (MDP) as a set of local policies that can be executed in a partially observable environment. The local policies are obtained by reinforcement learning and averaging state-action tables under a stochastic process model. This approach overcomes the scalability problem that arises when a large MDP has to be solved exactly. The approach is motivated by the problem of computing coordination policies for correlated but distributed sensors. We demonstrate the performance of this learning scheme on a simulation of a wireless body sensor network. These results show that the performance of the AMRL algorithm is significantly better than a random policy and is close to the optimal policy that can be obtained from solving a global MDP. The results also show that the AMRL algorithm is scalable to networks represented by large state spaces.

Keywords :

Markov processes; approximation theory; body sensor networks; decision making; distributed sensors; learning (artificial intelligence); AMRL algorithm; Markov decision process; approximation policies; average-max reinforcement learning algorithm; averaging state-action tables; coordination policies; distributed networks; distributed sensors; global MDP; learning scheme; local policies; optimal policy; partially observable environment; random policy; scalability problem; state spaces; stochastic process model; wireless body sensor network; Approximation algorithms; Artificial neural networks; Indexes; Microcontrollers; Silicon; Tin; Discrete; Distributed; Intelligent; Reinforcement Learning; Sensor Networks;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

World Automation Congress (WAC), 2014

Conference_Location :

Waikoloa, HI

Type :

conf

DOI :

10.1109/WAC.2014.6935892

Filename :

6935892

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=133738