• DocumentCode
    1850370
  • Title

    Fuzzy policy gradient reinforcement learning for leader-follower systems

  • Author

    Gu, Dongbing ; Yang, Erfu

  • Author_Institution
    Dept. of Comput. Sci., Essex Univ., Colchester, UK
  • Volume
    3
  • fYear
    2005
  • fDate
    2005
  • Firstpage
    1557
  • Abstract
    This paper presents a policy gradient multi-agent reinforcement learning algorithm for leader-follower systems. In this algorithm, cooperative dynamics of the leader-follower control is modelled as an incentive Stackelberg game. A linear incentive mechanism is used to connect the leader and follower policies. Policy gradient reinforcement learning explicitly explores policy parameter space to search the optimal policy. Fuzzy logic controllers are used as the policy. The parameters of fuzzy logic controllers can be improved by this policy gradient algorithm.
  • Keywords
    control engineering computing; fuzzy control; game theory; learning (artificial intelligence); multi-agent systems; cooperative dynamics; fuzzy logic controllers; fuzzy policy gradient reinforcement learning; incentive Stackelberg game; leader-follower systems; linear incentive mechanism; Convergence; Function approximation; Fuzzy logic; Fuzzy systems; Game theory; Heuristic algorithms; Learning; Minimax techniques; Multiagent systems; Stochastic processes;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Mechatronics and Automation, 2005 IEEE International Conference
  • Conference_Location
    Niagara Falls, Ont., Canada
  • Print_ISBN
    0-7803-9044-X
  • Type

    conf

  • DOI
    10.1109/ICMA.2005.1626787
  • Filename
    1626787