• DocumentCode
    2454174
  • Title

    Multi-Agent Inverse Reinforcement Learning

  • Author

    Natarajan, Sriraam ; Kunapuli, Gautam ; Judah, Kshitij ; Tadepalli, Prasad ; Kersting, Kristian ; Shavlik, Jude

  • fYear
    2010
  • fDate
    12-14 Dec. 2010
  • Firstpage
    395
  • Lastpage
    400
  • Abstract
    Learning the reward function of an agent by observing its behavior is termed inverse reinforcement learning and has applications in learning from demonstration or apprenticeship learning. We introduce the problem of multi-agent inverse reinforcement learning, where reward functions of multiple agents are learned by observing their uncoordinated behavior. A centralized controller then learns to coordinate their behavior by optimizing a weighted sum of reward functions of all the agents. We evaluate our approach on a traffic-routing domain, in which a controller coordinates actions of multiple traffic signals to regulate traffic density. We show that the learner is not only able to match but even significantly outperform the expert.
  • Keywords
    learning (artificial intelligence); multi-agent systems; traffic engineering computing; centralized controller; inverse reinforcement learning; multiagent; reward function; traffic density; traffic-routing domain; Aerospace electronics; Equations; Learning; Mathematical model; Optimization; Road transportation; Trajectory; Learning; Reinforcement Learning;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Machine Learning and Applications (ICMLA), 2010 Ninth International Conference on
  • Conference_Location
    Washington, DC
  • Print_ISBN
    978-1-4244-9211-4
  • Type

    conf

  • DOI
    10.1109/ICMLA.2010.65
  • Filename
    5708862