• DocumentCode
    2442721
  • Title

    Learning the IPA Market with Individual and Social Rewards

  • Author

    Gomes, Eduardo Rodrigues ; Kowalczyk, Ryszard

  • Author_Institution
    Swinburne Univ. of Technol., Hawthorn
  • fYear
    2007
  • fDate
    2-5 Nov. 2007
  • Firstpage
    328
  • Lastpage
    334
  • Abstract
    Market-based mechanisms offer a promising approach for distributed allocation of resources without centralized control. One of those mechanisms is the iterative price adjustment (IPA). Under standard assumptions, the IPA uses demand functions that do not allow the agents to have preferences over some attributes of the allocation, e.g. different price or resource levels. One of the alternatives to address this limitation is to describe the agents´ preferences using utility functions. In such a scenario, however, there is no unique mapping between the utility functions and a demand function. Gomes & Kowalczyk [10, 9] proposed the use of Reinforcement Learning to let the agents learn the demand functions given the utility functions. Their approach is based on the individual utilities of the agents at the end of the allocation. In this paper, we extend such a work by applying a new reward function, based on the social welfare of the allocation, and by considering more clients in the market. The learning process and the behavior of the agents using both reward functions are investigated through experiments and the results compared.
  • Keywords
    marketing data processing; resource allocation; software agents; iterative price adjustment; market-based mechanisms; resource distributed allocation; reward functions; Australia; Centralized control; Communications technology; Computer architecture; Distributed computing; Grid computing; Intelligent agent; Large-scale systems; Learning; Resource management;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Intelligent Agent Technology, 2007. IAT '07. IEEE/WIC/ACM International Conference on
  • Conference_Location
    Fremont, CA
  • Print_ISBN
    978-0-7695-3027-7
  • Type

    conf

  • DOI
    10.1109/IAT.2007.49
  • Filename
    4407306