DocumentCode
3060308
Title
Tracking recurrent concept drift in streaming data using ensemble classifiers
Author
Ramamurthy, Sasthakumar ; Bhatnagar, Raj
Author_Institution
Univ. of Cincinnati, Cincinnati
fYear
2007
fDate
13-15 Dec. 2007
Firstpage
404
Lastpage
409
Abstract
Streaming data may consist of multiple drifting concepts each having its own underlying data distribution. We present an ensemble learning based approach to handle the data streams having multiple underlying modes. We build a global set of classifiers from sequential data chunks; ensembles are then selected from this global set of classifiers, and new classifiers created if needed, to represent the current concept in the stream. The system is capable of performing any-time classification and to detect concept drift in the stream. In streaming data historic concepts are likely to reappear so we don´t delete any of the historic classifiers. Instead, we judiciously select only pertinent classifiers from the global set while forming the ensemble set for a classification task.
Keywords
data mining; learning (artificial intelligence); pattern classification; any-time classification; data distribution; data stream handling; ensemble classifiers; ensemble learning based approach; recurrent concept drift tracking; stream data mining; Application software; Computer science; Data flow computing; Data mining; Decision trees; Environmental economics; Information filtering; Machine learning; Statistics; Testing;
fLanguage
English
Publisher
ieee
Conference_Titel
Machine Learning and Applications, 2007. ICMLA 2007. Sixth International Conference on
Conference_Location
Cincinnati, OH
Print_ISBN
978-0-7695-3069-7
Type
conf
DOI
10.1109/ICMLA.2007.80
Filename
4457264
Link To Document