Multi-agent persistent monitoring in stochastic environments with temporal logic constraints

Author

Yushan Chen ; Kun Deng ; Belta, Calin

Author_Institution

Boston Univ., Boston, MA, USA

fYear

2012

fDate

10-13 Dec. 2012

Firstpage

2801

Lastpage

2806

Abstract

In this paper, we consider the problem of generating control policies for a team of robots moving in an environment containing elements with probabilistic behaviors. The team is required to achieve an optimal surveillance mission, in which a certain proposition needs to be satisfied infinitely often. The goal is to minimize the average time between satisfying instances of the proposition, while ensuring that the mission is accomplished. By modeling the robots as Transition Systems and the environmental elements as Markov Chains, the problem reduces to finding an optimal control policy satisfying a temporal logic specification on a Markov Decision Process. The existing approaches for this problem are computational intensive and therefore not feasible for a large environment or a large number of robots. To address this issue, we propose an approximate dynamic programming framework. Specifically, we choose a set of basis functions to approximate the optimal cost and find the best parameters for these functions based on the least-square approximation. We develop an approximate policy iteration algorithm to implement our framework. We provide illustrative case studies and evaluate our method through simulations.

Keywords

Markov processes; decision making; dynamic programming; least squares approximations; multi-agent systems; multi-robot systems; optimal control; probability; temporal logic; Markov chains; Markov decision process; approximate dynamic programming framework; control policies; least-square approximation; multiagent persistent monitoring; optimal control policy; optimal surveillance mission; probabilistic behaviors; robots team; stochastic environments; temporal logic constraints; transition systems; Approximation methods; Clocks; Games; Markov processes; Monitoring; Robots; Silicon;

fLanguage

English

Publisher

ieee

Conference_Titel

Decision and Control (CDC), 2012 IEEE 51st Annual Conference on

Conference_Location

Maui, HI

ISSN

0743-1546

Print_ISBN

978-1-4673-2065-8

Electronic_ISBN

0743-1546

Type

conf

DOI

10.1109/CDC.2012.6426280

Filename

6426280