• DocumentCode
    61355
  • Title

    Bayesian Active Learning With Non-Persistent Noise

  • Author

    Naghshvar, Mohammad ; Javidi, Tara ; Chaudhuri, Kamalika

  • Author_Institution
    Corp. R&D, Qualcomm Technol. Inc., San Diego, CA, USA
  • Volume
    61
  • Issue
    7
  • fYear
    2015
  • fDate
    Jul-15
  • Firstpage
    4080
  • Lastpage
    4098
  • Abstract
    We consider the problem of noisy Bayesian active learning where we are given a finite set of functions H, a sample space X , and a label set £. One of the functions in H assigns labels to samples in X . The goal is to identify the function that generates the labels even though the result of a label query on a sample is corrupted by independent noise. More precisely, the objective is to declare one of the functions in H as the true label generating function with high confidence using as a few label queries as possible, by selecting the queries adaptively and in a strategic manner. Previous work in Bayesian active learning considers generalized binary search and its variants for the noisy case, and analyzes the number of queries required by these sampling strategies. In this paper, we show that these schemes are, in general, suboptimal. Instead we propose and analyze an alternative strategy for sample collection. Our sampling strategy is motivated by a connection between Bayesian active learning and active hypothesis testing, and is based on querying the label of a sample, which maximizes the extrinsic Jensen-Shannon divergence at each step. We provide upper and lower bounds on the performance of this sampling strategy, and show that these bounds are better than the previous bounds in the literature.
  • Keywords
    belief networks; learning (artificial intelligence); Jensen-Shannon divergence; finite set; generalized binary search; independent noise corruption; label query; noisy Bayesian active learning; noisy case; nonpersistent noise; sample collection; sample space; strategic manner; Bayes methods; Complexity theory; Education; Noise; Noise measurement; Testing; Upper bound; Bayesian active learning; Extrinsic Jensen–Shannon divergence; extrinsic Jensen???Shannon divergence; generalized binary search; hypothesis testing;
  • fLanguage
    English
  • Journal_Title
    Information Theory, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0018-9448
  • Type

    jour

  • DOI
    10.1109/TIT.2015.2432101
  • Filename
    7105932