DocumentCode :
2110342
Title :
A Tensor Factorization Approach to Generalization in Multi-agent Reinforcement Learning
Author :
Bromuri, S.
Author_Institution :
Dept. of Bus. Inf. Syst., Univ. of Appl. Sci. Western Switzerland, Sierre, Switzerland
Volume :
2
fYear :
2012
fDate :
4-7 Dec. 2012
Firstpage :
274
Lastpage :
281
Abstract :
Reinforcement learning (RL) and multi-agent reinforcement learning (MARL) are disciplines concerned with defining automatically the behaviour of an agent, or a set of interacting agents, by means of reward mechanisms coming from the environment. An important research issue in the context of RL and MARL is the definition of approaches to combine the knowledge of multiple learning agents to improve the overall performance of the multi-agent system (MAS). This paper illustrates how to improve RL and MARL algorithms by utilizing results from multi-linear algebra such as tensors and tensor factorizations. In particular, the focus is on showing how to modify existing algorithms from literature to include a tensor factorization step applied to the Q-Tables learned by the individual agents to generalize the knowledge about the actions performed in the environment. The modified algorithms are then evaluated in three RL and MARL scenarios against their unmodified version to show the benefits of the tensor factorization step.
Keywords :
generalisation (artificial intelligence); learning (artificial intelligence); multi-agent systems; tensors; MARL algorithms; MAS; Q-tables; generalization; interacting agents; modified algorithms; multiagent reinforcement learning; multiagent system; multilinear algebra; multiple learning agents; reward mechanisms; tensor factorization approach; tensor factorizations; tensors; Algorithms; Dynamic Programming; Learning Systems; Software Agents;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Web Intelligence and Intelligent Agent Technology (WI-IAT), 2012 IEEE/WIC/ACM International Conferences on
Conference_Location :
Macau
Print_ISBN :
978-1-4673-6057-9
Type :
conf
DOI :
10.1109/WI-IAT.2012.21
Filename :
6511581
Link To Document :
بازگشت