DocumentCode
2004606
Title
Distributed reinforcement learning for sequential decision making
Author
Rogova, Galina ; Scott, Peter ; Lolett, Carlos
Author_Institution
Center for Multisource Inf. Fusion Encompass Consulting, Honeoye Falls, NY, USA
Volume
2
fYear
2002
fDate
8-11 July 2002
Firstpage
1263
Abstract
The paper addresses a problem of reinforcement learning in a homogeneous non-communicating multi-agent system for sequential decision making. We introduce a particular reinforcement learning model composed of evidential reinforcement neural networks representing agents, a fusion center, and a decision maker. The fusion center combines beliefs in each hypothesis under consideration generated by the agents and produces pignistic probabilities of the hypotheses under consideration. These pignistic probabilities are used by a decision maker in a sequential pignistic probability ratio test to choose one of two actions: "defer decision" or "decide hypothesis k". The test is shaped to encourage early decisions and incorporates a finite decision deadline. Upon each decision, a non-binary reinforcement signal is computed by the environment, and is then fed back to the agents, which utilize it to learn an optimizing belief function. The learning algorithm adapts the "profit sharing strategy" to the sequential decision making setting.
Keywords
distributed decision making; learning (artificial intelligence); sensor fusion; Profit sharing strategy; agents; fusion center; multi-agent system; neural networks; pignistic likelihood ratios test; reinforcement learning; sequential decision making; Computer science; Decision making; Delay; Fusion power generation; Learning; Multiagent systems; Neural networks; Sequential analysis; System testing; Target recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Information Fusion, 2002. Proceedings of the Fifth International Conference on
Conference_Location
Annapolis, MD, USA
Print_ISBN
0-9721844-1-4
Type
conf
DOI
10.1109/ICIF.2002.1020958
Filename
1020958
Link To Document