DocumentCode
3265203
Title
Solving Multi-Agent Markov Decision Processes using learning automata
Author
Abtahi, Farnaz ; Meybodi, Mohammad Reza
Author_Institution
Dept. of Comput. Eng. & IT, Amirkabir Univ. of Technol., Tehran
fYear
2008
fDate
26-27 Sept. 2008
Firstpage
1
Lastpage
6
Abstract
Multi-agent Markov decision processes (MMDPs) are widely used for modeling many types of multi-agent systems. In this paper, two new algorithms based on learning automata are proposed for solving MMDPs and finding optimal policies. In the proposed algorithms, Markov problem is described as a directed graph. The nodes of this graph are the states of the problem, and the directed edges represent the actions that result in transition from one state to another. Each node in the graph is equipped with a learning automaton and its actions are the outgoing edges of that node. Each agent moves from one node to another and tries to reach the goal state. In each node, the agent chooses its next transition with help of the learning automaton in that node. The actions taken by learning automata along the path traveled by the agent is then rewarded or penalized based on the cost of the traveled path according to a learning algorithm. This way the optimal policy for the agent will be gradually reached. The results of experiments have shown that our proposed algorithms perform better than the existing learning automata based algorithms in terms of cost and the speed of reaching the optimal policy.
Keywords
Markov processes; directed graphs; learning automata; multi-agent systems; directed graph; learning automata; multiagent Markov decision processes; multiagent systems; optimal policies; Cost function; Learning automata; Multiagent systems; Space technology; State-space methods; Learning Automata; Multi-Agent Markov Decision Process; Multi-Agent Systems; Optimal Policy;
fLanguage
English
Publisher
ieee
Conference_Titel
Intelligent Systems and Informatics, 2008. SISY 2008. 6th International Symposium on
Conference_Location
Subotica
Print_ISBN
978-1-4244-2406-1
Electronic_ISBN
978-1-4244-2407-8
Type
conf
DOI
10.1109/SISY.2008.4664909
Filename
4664909
Link To Document