• DocumentCode
    3265203
  • Title

    Solving Multi-Agent Markov Decision Processes using learning automata

  • Author

    Abtahi, Farnaz ; Meybodi, Mohammad Reza

  • Author_Institution
    Dept. of Comput. Eng. & IT, Amirkabir Univ. of Technol., Tehran
  • fYear
    2008
  • fDate
    26-27 Sept. 2008
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    Multi-agent Markov decision processes (MMDPs) are widely used for modeling many types of multi-agent systems. In this paper, two new algorithms based on learning automata are proposed for solving MMDPs and finding optimal policies. In the proposed algorithms, Markov problem is described as a directed graph. The nodes of this graph are the states of the problem, and the directed edges represent the actions that result in transition from one state to another. Each node in the graph is equipped with a learning automaton and its actions are the outgoing edges of that node. Each agent moves from one node to another and tries to reach the goal state. In each node, the agent chooses its next transition with help of the learning automaton in that node. The actions taken by learning automata along the path traveled by the agent is then rewarded or penalized based on the cost of the traveled path according to a learning algorithm. This way the optimal policy for the agent will be gradually reached. The results of experiments have shown that our proposed algorithms perform better than the existing learning automata based algorithms in terms of cost and the speed of reaching the optimal policy.
  • Keywords
    Markov processes; directed graphs; learning automata; multi-agent systems; directed graph; learning automata; multiagent Markov decision processes; multiagent systems; optimal policies; Cost function; Learning automata; Multiagent systems; Space technology; State-space methods; Learning Automata; Multi-Agent Markov Decision Process; Multi-Agent Systems; Optimal Policy;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Intelligent Systems and Informatics, 2008. SISY 2008. 6th International Symposium on
  • Conference_Location
    Subotica
  • Print_ISBN
    978-1-4244-2406-1
  • Electronic_ISBN
    978-1-4244-2407-8
  • Type

    conf

  • DOI
    10.1109/SISY.2008.4664909
  • Filename
    4664909