Title :
Certainty and expertness-based credit assignment for cooperative Q-Learning agents with an AND-type task
Author :
Harati, Ahad ; Ahmadabadi, Majid Nili
Author_Institution :
Dept. of Electr. & Comput. Eng., Tehran Univ., Iran
Abstract :
In multiagent reinforcement learning, inter-agent credit assignment is a fundamental problem, since a single scalar reinforcement signal is the only reliable feedback that teams of learning agents receive. This problem is more critical in groups of independent learners with a joint task. In this research, it is assumed that a critic agent receives the environment feedback and assigns a proper credit to each agent using some measures. Three of such measures for a team of cooperative agents with a parallel and AND-type task are introduced. These measures somehow compare the agents´ knowledge. One of these criteria, called normal expertness, is a non-relative measure while two other ones (certainty and relative normal expertness) are relative measure. It is experimentally shown that relative measures work better as they contain more information for the critic agent.
Keywords :
expert systems; feedback; learning (artificial intelligence); multi-agent systems; AND-type task; Q-learning agents; cooperative agents; critic agent; expertness-based credit assignment; feedback; multiagent reinforcement learning; normal expertness; Artificial intelligence; Control systems; Feedback; Intelligent agent; Intelligent control; Intelligent robots; Physics computing; Process control; Reliability engineering; Robot kinematics;
Conference_Titel :
Neural Information Processing, 2002. ICONIP '02. Proceedings of the 9th International Conference on
Print_ISBN :
981-04-7524-1
DOI :
10.1109/ICONIP.2002.1202183