• DocumentCode
    2935431
  • Title

    Game Theoretic Control for Robot Teams

  • Author

    Emery-Montemerlo, Rosemary ; Gordon, Geoff ; Schneider, Jeff ; Thrun, Sebastian

  • Author_Institution
    School of Computer Science Carnegie Mellon University Pittsburgh PA 15312; remery@cs.cmu.edu
  • fYear
    2005
  • fDate
    18-22 April 2005
  • Firstpage
    1163
  • Lastpage
    1169
  • Abstract
    In the real world, noisy sensors and limited communication make it difficult for robot teams to coordinate in tightly coupled tasks. Team members cannot simply apply single-robot solution techniques for partially observable problems in parallel because they do not take into account the recursive effect that reasoning about the beliefs of others has on policy generation. Instead, we must turn to a game theoretic approach to model the problem correctly. Partially observable stochastic games (POSGs) provide a solution model for decentralized robot teams, however, this model quickly becomes intractable. In previous work we presented an algorithm for lookahead search in POSGs. Here we present an extension which reduces computation during lookahead by clustering similar observation histories together. We show that by clustering histories which have similar profiles of predicted reward, we can greatly reduce the computation time required to solve a POSG while maintaining a good approximation to the optimal policy. We demonstrate the power of the clustering algorithm in a real-time robot controller as well as for a simple benchmark problem.
  • Keywords
    Decentralized control; Multi-Robot Coordination; Partially Observable Domains; Planning; Bandwidth; Clustering algorithms; Game theory; History; Mobile communication; Orbital robotics; Robot control; Robot kinematics; Robot sensing systems; Stochastic processes; Decentralized control; Multi-Robot Coordination; Partially Observable Domains; Planning;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Robotics and Automation, 2005. ICRA 2005. Proceedings of the 2005 IEEE International Conference on
  • Print_ISBN
    0-7803-8914-X
  • Type

    conf

  • DOI
    10.1109/ROBOT.2005.1570273
  • Filename
    1570273