• DocumentCode
    38494
  • Title

    Predictive Handling of Asynchronous Concept Drifts in Distributed Environments

  • Author

    Hock Hee Ang ; Gopalkrishnan, Vivekanand ; Zliobaite, Indre ; Pechenizkiy, Mykola ; Hoi, Steven C. H.

  • Author_Institution
    Sch. of Comput. Eng., Nanyang Technol. Univ., Singapore, Singapore
  • Volume
    25
  • Issue
    10
  • fYear
    2013
  • fDate
    Oct. 2013
  • Firstpage
    2343
  • Lastpage
    2355
  • Abstract
    In a distributed computing environment, peers collaboratively learn to classify concepts of interest from each other. When external changes happen and their concepts drift, the peers should adapt to avoid increase in misclassification errors. The problem of adaptation becomes more difficult when the changes are asynchronous, i.e., when peers experience drifts at different times. We address this problem by developing an ensemble approach, PINE, that combines reactive adaptation via drift detection, and proactive handling of upcoming changes via early warning and adaptation across the peers. With empirical study on simulated and real-world data sets, we show that PINE handles asynchronous concept drifts better and faster than current state-of-the-art approaches, which have been designed to work in less challenging environments. In addition, PINE is parameter insensitive and incurs less communication cost while achieving better accuracy.
  • Keywords
    distributed processing; learning (artificial intelligence); pattern classification; PINE ensemble approach; communication cost; distributed computing environment; drift detection; empirical analysis; external changes; misclassification errors; predictive asynchronous concept drift handling; proactive change handling; reactive adaptation; real-world data sets; simulated data sets; Accuracy; Adaptation models; Data models; Detectors; Distributed databases; Predictive models; Training; Accuracy; Adaptation models; Classification; Data models; Detectors; Distributed databases; Predictive models; Training; concept drift; distributed systems;
  • fLanguage
    English
  • Journal_Title
    Knowledge and Data Engineering, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1041-4347
  • Type

    jour

  • DOI
    10.1109/TKDE.2012.172
  • Filename
    6294406