Title :
Solving Multi-Agent Markov Decision Processes using learning automata
Author :
Abtahi, Farnaz ; Meybodi, Mohammad Reza
Author_Institution :
Dept. of Comput. Eng. & IT, Amirkabir Univ. of Technol., Tehran
Abstract :
Multi-agent Markov decision processes (MMDPs) are widely used for modeling many types of multi-agent systems. In this paper, two new algorithms based on learning automata are proposed for solving MMDPs and finding optimal policies. In the proposed algorithms, Markov problem is described as a directed graph. The nodes of this graph are the states of the problem, and the directed edges represent the actions that result in transition from one state to another. Each node in the graph is equipped with a learning automaton and its actions are the outgoing edges of that node. Each agent moves from one node to another and tries to reach the goal state. In each node, the agent chooses its next transition with help of the learning automaton in that node. The actions taken by learning automata along the path traveled by the agent is then rewarded or penalized based on the cost of the traveled path according to a learning algorithm. This way the optimal policy for the agent will be gradually reached. The results of experiments have shown that our proposed algorithms perform better than the existing learning automata based algorithms in terms of cost and the speed of reaching the optimal policy.
Keywords :
Markov processes; directed graphs; learning automata; multi-agent systems; directed graph; learning automata; multiagent Markov decision processes; multiagent systems; optimal policies; Cost function; Learning automata; Multiagent systems; Space technology; State-space methods; Learning Automata; Multi-Agent Markov Decision Process; Multi-Agent Systems; Optimal Policy;
Conference_Titel :
Intelligent Systems and Informatics, 2008. SISY 2008. 6th International Symposium on
Conference_Location :
Subotica
Print_ISBN :
978-1-4244-2406-1
Electronic_ISBN :
978-1-4244-2407-8
DOI :
10.1109/SISY.2008.4664909