• DocumentCode
    2051688
  • Title

    Distributed W-Learning: Multi-Policy Optimization in Self-Organizing Systems

  • Author

    Dusparic, Ivana ; Cahill, Vinny

  • Author_Institution
    Lero-The Irish Software Eng. Res. Centre, Trinity Coll. Dublin, Dublin, Ireland
  • fYear
    2009
  • fDate
    14-18 Sept. 2009
  • Firstpage
    20
  • Lastpage
    29
  • Abstract
    Large-scale agent-based systems are required to self-optimize towards multiple, potentially conflicting, policies of varying spatial and temporal scope. As a result, not all agents may be implementing all policies at all times, resulting in agent heterogeneity. As agents share their operating environment, significant dependencies can arise between agents and therefore between policy implementations. To address self-optimization in the presence of agent heterogeneity, policy dependency and the lack of global knowledge that is inherent in large-scale decentralized environments, we propose Distributed W-Learning (DWL). DWL is a reinforcement learning (RL)-based algorithm for collaborative agent-based self-optimization towards multiple policies, which relies only on local interactions and learning. We have evaluated the DWL algorithm in a simulation of a self-organizing urban traffic control (UTC) system and show that using DWL can improve the performance of multiple policies deployed simultaneously, even over corresponding single-policy deployments. For example, in UTC, optimizing simultaneously for cars and public transport vehicles reduces the waiting times of cars to 78% of their waiting times in the best-performing single-policy deployment that optimizes for cars only, while also outperforming the widely-deployed round-robin and saturation balancing traffic controllers that we used as baselines.
  • Keywords
    distributed algorithms; learning (artificial intelligence); multi-agent systems; agent heterogeneity; collaborative agent; distributed W-learning; global knowledge; large-scale agent-based systems; large-scale decentralized environment; multipolicy optimization; operating environment; policy dependency; reinforcement learning; self-optimization; self-organizing systems; self-organizing urban traffic control system; Collaboration; Computer science; Educational institutions; Large-scale systems; Learning; Load management; Software engineering; Statistical distributions; Telecommunication traffic; Vehicles; Multi-policy optimization; reinforcement learning; self-organizing systems;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Self-Adaptive and Self-Organizing Systems, 2009. SASO '09. Third IEEE International Conference on
  • Conference_Location
    San Francisco, CA
  • Print_ISBN
    978-1-4244-4890-6
  • Electronic_ISBN
    978-0-7695-3794-8
  • Type

    conf

  • DOI
    10.1109/SASO.2009.23
  • Filename
    5298481