Title :
A Distributed Actor-Critic Algorithm and Applications to Mobile Sensor Network Coordination Problems
Author :
Pennesi, Paris ; Paschalidis, Ioannis Ch
Author_Institution :
RBS Global Banking & Markets, London, UK
Abstract :
We introduce and establish the convergence of a distributed actor-critic method that orchestrates the coordination of multiple agents solving a general class of a Markov decision problem. The method leverages the centralized single-agent actor-critic algorithm of and uses a consensus-like algorithm for updating agents´ policy parameters. As an application and to validate our approach we consider a reward collection problem as an instance of a multi-agent coordination problem in a partially known environment and subject to dynamical changes and communication constraints.
Keywords :
Markov processes; convergence; decision theory; distributed algorithms; mobile communication; multi-agent systems; wireless sensor networks; Markov decision problem; agent policy parameters; consensus like algorithm; distributed actor critic algorithm; mobile sensor network coordination problems; multi agent coordination; reward collection problem; Adaptive control; Adaptive signal processing; Algorithm design and analysis; Banking; Convergence; Dynamic programming; Neural networks; Optimization methods; Prediction algorithms; Robot kinematics; Robot sensing systems; Signal processing algorithms; Space exploration; Stochastic processes; Stochastic systems; Systems engineering and theory; US Department of Energy; Actor-critic methods; Markov decision processes (MDP); consensus; multi-agent coordination; sensor networks;
Journal_Title :
Automatic Control, IEEE Transactions on
DOI :
10.1109/TAC.2009.2037462