• DocumentCode
    15547
  • Title

    Active Learning With Drifting Streaming Data

  • Author

    Zliobaite, Indre ; Bifet, Albert ; Pfahringer, Bernhard ; Holmes, Graham

  • Author_Institution
    Smart Technol. Res. Center, Bournemouth Univ., Poole, UK
  • Volume
    25
  • Issue
    1
  • fYear
    2014
  • fDate
    Jan. 2014
  • Firstpage
    27
  • Lastpage
    39
  • Abstract
    In learning to classify streaming data, obtaining true labels may require major effort and may incur excessive cost. Active learning focuses on carefully selecting as few labeled instances as possible for learning an accurate predictive model. Streaming data poses additional challenges for active learning, since the data distribution may change over time (concept drift) and models need to adapt. Conventional active learning strategies concentrate on querying the most uncertain instances, which are typically concentrated around the decision boundary. Changes occurring further from the boundary may be missed, and models may fail to adapt. This paper presents a theoretically supported framework for active learning from drifting data streams and develops three active learning strategies for streaming data that explicitly handle concept drift. They are based on uncertainty, dynamic allocation of labeling efforts over time, and randomization of the search space. We empirically demonstrate that these strategies react well to changes that can occur anywhere in the instance space and unexpectedly.
  • Keywords
    data handling; learning (artificial intelligence); pattern classification; active learning strategies; data distribution; dynamic labeling effort allocation; explicit concept drift handling; labeling effort uncertainty; predictive model; search space; streaming data classification; streaming data drifting; Adaptation models; Labeling; Laboratories; Learning systems; Predictive models; Production; Uncertainty; Active learning; concept drift; data streams; user feedback;
  • fLanguage
    English
  • Journal_Title
    Neural Networks and Learning Systems, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    2162-237X
  • Type

    jour

  • DOI
    10.1109/TNNLS.2012.2236570
  • Filename
    6414645