• DocumentCode
    2485271
  • Title

    An Adaptive Distributed Ensemble Approach to Mine Concept-Drifting Data Streams

  • Author

    Folino, Gianluigi ; Pizzuti, Clara ; Spezzano, Giandomenico

  • Author_Institution
    ICAR-CNR, Rende
  • Volume
    2
  • fYear
    2007
  • fDate
    29-31 Oct. 2007
  • Firstpage
    183
  • Lastpage
    188
  • Abstract
    An adaptive boosting ensemble algorithm for classifying homogeneous distributed data streams is presented. The method builds an ensemble of classifiers by using Genetic Programming (GP) to inductively generate decision trees, each trained on different parts of the distributed training set. The approach adopts a co-evolutionary platform to support a cooperative model of GP. A change detection strategy, based on self-similarity of the ensemble behavior, and measured by its fractal dimension, permits to capture time- evolving trends and patterns in the stream, and to reveal changes in evolving data streams. The approach tracks online ensemble accuracy deviation over time and decides to recompute the ensemble if the deviation has exceeded a pre- specified threshold. This allows the maintenance of an accurate and up-to-date ensemble of classifiers for continuous flows of data with concept drifts. Experimental results on a real life data set show the validity of the approach.
  • Keywords
    data mining; decision trees; genetic algorithms; pattern classification; adaptive boosting ensemble algorithm; adaptive distributed ensemble approach; concept-drifting data streams; decision trees; distributed training set; genetic programming; homogeneous distributed data streams; Boosting; Computer networks; Concurrent computing; Data mining; Decision trees; Fractals; Genetic programming; Large-scale systems; Peer to peer computing; Telephony;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Tools with Artificial Intelligence, 2007. ICTAI 2007. 19th IEEE International Conference on
  • Conference_Location
    Patras
  • ISSN
    1082-3409
  • Print_ISBN
    978-0-7695-3015-4
  • Type

    conf

  • DOI
    10.1109/ICTAI.2007.51
  • Filename
    4410377