• DocumentCode
    2470905
  • Title

    Inverse reinforcement learning for decentralized non-cooperative multiagent systems

  • Author

    Reddy, Tummalapalli Sudhamsh ; Gopikrishna, Vamsikrishna ; Zaruba, Gergely ; Huber, Manfred

  • Author_Institution
    Comput. Sci. Dept., Univ. of Texas at Arlington, Arlington, TX, USA
  • fYear
    2012
  • fDate
    14-17 Oct. 2012
  • Firstpage
    1930
  • Lastpage
    1935
  • Abstract
    The objective of inverse reinforcement learning (IRL) is to learn an agent´s reward function based on either the agent´s policies or the observations of the policy. In this paper we address the issue of using inverse reinforcement learning to learn the reward function in a multi agent setting, where the agents can either cooperate or be strictly non-cooperative. The case of cooperataing agents is a subcase of the non-cooperative setting, where the agents collectively try to maximize a common reward function, instead of maximizing their individual reward functions. Here we present an IRL algorithm that considers the case where the policies of the agents are known. We use the framework that was described by Ng and Russell [2001] and extend it for a Multiagent setting. We assume that the agents are rational and follow an optimal policy in the sense of the Nash Equilibrium. These assumptions are very common in Multiagent systems. We show that in the case of known policies we can reduce the Multiagent problem to a distributed solution where the reward function for each agent can be solved independently using a very similar formulation as for the single agent case.
  • Keywords
    game theory; learning (artificial intelligence); multi-agent systems; multivariable systems; IRL algorithm; Nash equilibrium; agent policy; agent reward function learning; common reward function maximization; decentralized noncooperative multiagent systems; inverse reinforcement learning; policy observation; rational agent; Games; Joints; Markov processes; Multiagent systems; Nash equilibrium; Trajectory; Game Theory; General-Sum Stochastic Games; Inverse Reinfocement Learning; Multiagent Systems; Nash equilibrium;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Systems, Man, and Cybernetics (SMC), 2012 IEEE International Conference on
  • Conference_Location
    Seoul
  • Print_ISBN
    978-1-4673-1713-9
  • Electronic_ISBN
    978-1-4673-1712-2
  • Type

    conf

  • DOI
    10.1109/ICSMC.2012.6378020
  • Filename
    6378020