• DocumentCode
    3168557
  • Title

    Conditions for learning in generalized tandem networks

  • Author

    Drakopoulos, Kimon ; Ozdaglar, Asuman ; Tsitsiklis, John N.

  • Author_Institution
    Lab. of Inf. & Decision Syst., Massachusetts Inst. of Technol., Cambridge, MA, USA
  • fYear
    2012
  • fDate
    10-13 Dec. 2012
  • Firstpage
    7437
  • Lastpage
    7444
  • Abstract
    We consider an infinite collection of agents who make decisions, sequentially, about an unknown underlying binary state of the world. Each agent, prior to making a decision, receives an independent private signal whose distribution depends on the state of the world. Moreover, each agent also observes the decisions of its last K immediate predecessors. We study conditions under which the agent decisions converge to the correct value of the underlying state. We focus on the case where the private signals have bounded information content and investigate whether learning is possible, that is, whether there exist decision rules for the different agents that result in the convergence of their sequence of individual decisions to the correct state of the world. We first consider learning in the almost sure sense and show that it is impossible, for any value of K. We then explore the possibility of convergence in probability of the decisions to the correct state. Here, a distinction arises: if K = 1, learning in probability is impossible under any decision rule, while for K ≥ 2, we design a decision rule that achieves it.
  • Keywords
    decision making; learning (artificial intelligence); multi-agent systems; probability; agent; decision making; generalized tandem network; learning; probability; underlying binary state; Bayesian methods; Convergence; Markov processes; Medical treatment; Random variables; Sensors; Testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Decision and Control (CDC), 2012 IEEE 51st Annual Conference on
  • Conference_Location
    Maui, HI
  • ISSN
    0743-1546
  • Print_ISBN
    978-1-4673-2065-8
  • Electronic_ISBN
    0743-1546
  • Type

    conf

  • DOI
    10.1109/CDC.2012.6426271
  • Filename
    6426271