مرکز منطقه ای اطلاع رساني علوم و فناوري - Multi-agent Q-learning and regression trees for automated pricing decisions

DocumentCode :

2271877

Title :

Multi-agent Q-learning and regression trees for automated pricing decisions

Author :

Sridharan, Manu ; Tesauro, Gerald

Author_Institution :

IBM Thomas J. Watson Res. Center, Hawthorne, NY, USA

fYear :

2000

fDate :

2000

Firstpage :

447

Lastpage :

448

Abstract :

We study the use of the reinforcement learning algorithm Q-learning with regression tree function approximation to learn pricing strategies in a competitive marketplace of economic software agents. Q-learning is an algorithm for learning to estimate the long-term expected reward for a given state-action pair. In the case of a stationary environment with a lookup table representing the Q-function, the learning procedure is guaranteed to converge to an optimal policy. However, utilizing Q-learning in multi-agent systems presents special challenges. The simultaneous adaptation of multiple agents creates a non-stationary environment for each agent, hence there are no theoretical guarantees of convergence or optimality. Also, large multi-agent systems may have state spaces too large to represent with lookup tables, necessitating the use of function approximation

Keywords :

costing; economics; function approximation; learning (artificial intelligence); multi-agent systems; statistical analysis; table lookup; trees (mathematics); automated pricing decisions; convergence; economic software agents; function approximation; long-term expected reward; lookup table; multi-agent Q-learning; optimal policy; regression trees; reinforcement learning; Approximation algorithms; Environmental economics; Function approximation; Learning; Multiagent systems; Pricing; Regression tree analysis; Software agents; Software algorithms; Table lookup;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

MultiAgent Systems, 2000. Proceedings. Fourth International Conference on

Conference_Location :

Boston, MA

Print_ISBN :

0-7695-0625-9

Type :

conf

DOI :

10.1109/ICMAS.2000.858518

Filename :

858518

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2271877