DocumentCode :
2034402
Title :
A classification algorithm for noisy data streams
Author :
Yan Li ; Zhang, Yuhong ; Hu Xuegang ; Li Peipei
Author_Institution :
Sch. of Comput. & Inf., Hefei Univ. of Technol., Hefei, China
Volume :
5
fYear :
2010
fDate :
10-12 Aug. 2010
Firstpage :
2239
Lastpage :
2244
Abstract :
Classification on noisy data streams has recently become one of the most important topics in streaming data mining. In this paper, a Classification algorithm for mining Data Streams based on Mixture Models of C4.5 and NB is proposed called CDSMM. In this algorithm, C4.5 is used as the base classifiers, the hypothesis testing method is introduced for the detection of concept drifts, and a Naïve Bayes classifier is adopted to filter noise. Extensive experiments demonstrate that CDSMM has substantial advantages over similar existing algorithms in the predictive accuracy on noisy data streams with concept drifts.
Keywords :
Bayes methods; data mining; pattern classification; Bayes classifier; C4.5; CDSMM; hypothesis testing method; noisy data stream classification; streaming data mining; Accuracy; Classification algorithms; Data mining; Niobium; Noise; Noise measurement; Prediction algorithms; Classification; Concept Drift; Data Stream; Noise;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Fuzzy Systems and Knowledge Discovery (FSKD), 2010 Seventh International Conference on
Conference_Location :
Yantai, Shandong
Print_ISBN :
978-1-4244-5931-5
Type :
conf
DOI :
10.1109/FSKD.2010.5569533
Filename :
5569533
Link To Document :
بازگشت