Title :
Distributed reinforcement learning for sequential decision making
Author :
Rogova, Galina ; Scott, Peter ; Lolett, Carlos
Author_Institution :
Center for Multisource Inf. Fusion Encompass Consulting, Honeoye Falls, NY, USA
Abstract :
The paper addresses a problem of reinforcement learning in a homogeneous non-communicating multi-agent system for sequential decision making. We introduce a particular reinforcement learning model composed of evidential reinforcement neural networks representing agents, a fusion center, and a decision maker. The fusion center combines beliefs in each hypothesis under consideration generated by the agents and produces pignistic probabilities of the hypotheses under consideration. These pignistic probabilities are used by a decision maker in a sequential pignistic probability ratio test to choose one of two actions: "defer decision" or "decide hypothesis k". The test is shaped to encourage early decisions and incorporates a finite decision deadline. Upon each decision, a non-binary reinforcement signal is computed by the environment, and is then fed back to the agents, which utilize it to learn an optimizing belief function. The learning algorithm adapts the "profit sharing strategy" to the sequential decision making setting.
Keywords :
distributed decision making; learning (artificial intelligence); sensor fusion; Profit sharing strategy; agents; fusion center; multi-agent system; neural networks; pignistic likelihood ratios test; reinforcement learning; sequential decision making; Computer science; Decision making; Delay; Fusion power generation; Learning; Multiagent systems; Neural networks; Sequential analysis; System testing; Target recognition;
Conference_Titel :
Information Fusion, 2002. Proceedings of the Fifth International Conference on
Conference_Location :
Annapolis, MD, USA
Print_ISBN :
0-9721844-1-4
DOI :
10.1109/ICIF.2002.1020958