• DocumentCode
    2766155
  • Title

    Learning a Rendezvous Task with Dynamic Joint Action Perception

  • Author

    Fulda, Nancy ; Ventura, Dan

  • Author_Institution
    Brigham Young Univ., Provo
  • fYear
    0
  • fDate
    0-0 0
  • Firstpage
    235
  • Lastpage
    240
  • Abstract
    Groups of reinforcement learning agents interacting in a common environment often fail to learn optimal behaviors. Poor performance is particularly common in environments where agents must coordinate with each other to receive rewards and where failed coordination attempts are penalized. This paper studies the effectiveness of the dynamic joint action perception (DJAP) algorithm on a grid-world rendezvous task with this characteristic. The effects of learning rate, exploration strategy, and training time on algorithm effectiveness are discussed. An analysis of the types of tasks for which DJAP learning is appropriate is also presented.
  • Keywords
    learning (artificial intelligence); multi-agent systems; dynamic joint action perception; exploration strategy; grid-world; reinforcement learning agents; rendezvous task; Algorithm design and analysis; Computer science; Equations; Learning; Performance analysis; Scalability; Shadow mapping; State estimation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Neural Networks, 2006. IJCNN '06. International Joint Conference on
  • Conference_Location
    Vancouver, BC
  • Print_ISBN
    0-7803-9490-9
  • Type

    conf

  • DOI
    10.1109/IJCNN.2006.246686
  • Filename
    1716097