DocumentCode :
2850455
Title :
Dynamic classifier selection for effective mining from noisy data streams
Author :
Zhu, Xingquan ; Wu, Xindong ; Yang, Ying
Author_Institution :
Dept. of Comput. Sci., Vermont Univ., Burlington, VT, USA
fYear :
2004
fDate :
1-4 Nov. 2004
Firstpage :
305
Lastpage :
312
Abstract :
Mining from data streams has become an important and challenging task for many real-world applications such as credit card fraud protection and sensor networking. One popular solution is to separate stream data into chunks, learn a base classifier from each chunk, and then integrate all base classifiers for effective classification. In this paper, we propose a dynamic classifier selection (DCS) mechanism to integrate base classifiers for effective mining from data streams. The proposed algorithm dynamically selects a single "best" classifier to classify each test instance at run time. Our scheme uses statistical information from attribute values, and uses each attribute to partition the evaluation set into disjoint subsets, followed by a procedure that evaluates the classification accuracy of each base classifier on these subsets. Given a test instance, its attribute values determine the subsets that the similar instances in the evaluation set have constructed, and the classifier with the highest classification accuracy on those subsets is selected to classify the test instance. Experimental results and comparative studies demonstrate the efficiency and efficacy of our method. Such a DCS scheme appears to be promising in mining data streams with dramatic concept drifting or with a significant amount of noise, where the base classifiers are likely conflictive or have low confidence.
Keywords :
data mining; noise; pattern classification; base classifier; dynamic classifier selection; noisy data stream mining; Application software; Computer science; Credit cards; Data mining; Distributed control; Heuristic algorithms; Partitioning algorithms; Protection; Testing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Mining, 2004. ICDM '04. Fourth IEEE International Conference on
Print_ISBN :
0-7695-2142-8
Type :
conf
DOI :
10.1109/ICDM.2004.10091
Filename :
1410298
Link To Document :
بازگشت