• DocumentCode
    3168771
  • Title

    Multi-agent persistent monitoring in stochastic environments with temporal logic constraints

  • Author

    Yushan Chen ; Kun Deng ; Belta, Calin

  • Author_Institution
    Boston Univ., Boston, MA, USA
  • fYear
    2012
  • fDate
    10-13 Dec. 2012
  • Firstpage
    2801
  • Lastpage
    2806
  • Abstract
    In this paper, we consider the problem of generating control policies for a team of robots moving in an environment containing elements with probabilistic behaviors. The team is required to achieve an optimal surveillance mission, in which a certain proposition needs to be satisfied infinitely often. The goal is to minimize the average time between satisfying instances of the proposition, while ensuring that the mission is accomplished. By modeling the robots as Transition Systems and the environmental elements as Markov Chains, the problem reduces to finding an optimal control policy satisfying a temporal logic specification on a Markov Decision Process. The existing approaches for this problem are computational intensive and therefore not feasible for a large environment or a large number of robots. To address this issue, we propose an approximate dynamic programming framework. Specifically, we choose a set of basis functions to approximate the optimal cost and find the best parameters for these functions based on the least-square approximation. We develop an approximate policy iteration algorithm to implement our framework. We provide illustrative case studies and evaluate our method through simulations.
  • Keywords
    Markov processes; decision making; dynamic programming; least squares approximations; multi-agent systems; multi-robot systems; optimal control; probability; temporal logic; Markov chains; Markov decision process; approximate dynamic programming framework; control policies; least-square approximation; multiagent persistent monitoring; optimal control policy; optimal surveillance mission; probabilistic behaviors; robots team; stochastic environments; temporal logic constraints; transition systems; Approximation methods; Clocks; Games; Markov processes; Monitoring; Robots; Silicon;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Decision and Control (CDC), 2012 IEEE 51st Annual Conference on
  • Conference_Location
    Maui, HI
  • ISSN
    0743-1546
  • Print_ISBN
    978-1-4673-2065-8
  • Electronic_ISBN
    0743-1546
  • Type

    conf

  • DOI
    10.1109/CDC.2012.6426280
  • Filename
    6426280