• DocumentCode
    3060308
  • Title

    Tracking recurrent concept drift in streaming data using ensemble classifiers

  • Author

    Ramamurthy, Sasthakumar ; Bhatnagar, Raj

  • Author_Institution
    Univ. of Cincinnati, Cincinnati
  • fYear
    2007
  • fDate
    13-15 Dec. 2007
  • Firstpage
    404
  • Lastpage
    409
  • Abstract
    Streaming data may consist of multiple drifting concepts each having its own underlying data distribution. We present an ensemble learning based approach to handle the data streams having multiple underlying modes. We build a global set of classifiers from sequential data chunks; ensembles are then selected from this global set of classifiers, and new classifiers created if needed, to represent the current concept in the stream. The system is capable of performing any-time classification and to detect concept drift in the stream. In streaming data historic concepts are likely to reappear so we don´t delete any of the historic classifiers. Instead, we judiciously select only pertinent classifiers from the global set while forming the ensemble set for a classification task.
  • Keywords
    data mining; learning (artificial intelligence); pattern classification; any-time classification; data distribution; data stream handling; ensemble classifiers; ensemble learning based approach; recurrent concept drift tracking; stream data mining; Application software; Computer science; Data flow computing; Data mining; Decision trees; Environmental economics; Information filtering; Machine learning; Statistics; Testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Machine Learning and Applications, 2007. ICMLA 2007. Sixth International Conference on
  • Conference_Location
    Cincinnati, OH
  • Print_ISBN
    978-0-7695-3069-7
  • Type

    conf

  • DOI
    10.1109/ICMLA.2007.80
  • Filename
    4457264