DocumentCode
183006
Title
Ensemble based data stream mining with recalling and forgetting mechanisms
Author
Yanhuang Jiang ; Qiangli Zhao ; Yutong Lu
Author_Institution
State Key Lab. of High Performance Comput., Nat. Univ. of Defense Technol., Changsha, China
fYear
2014
fDate
19-21 Aug. 2014
Firstpage
430
Lastpage
435
Abstract
Using ensemble of classifiers on sequential chunks of training instances is a popular strategy for data stream mining. Aiming at the limitations of the existing approaches, we introduce recalling and forgetting mechanisms into ensemble based data stream mining, and put forward a new algorithm MAE (Memorizing based Adaptive Ensemble) to mine chunk-based data streams with concept drifts. Ensemble pruning is used as a recalling mechanism to select useful component classifiers for each incoming data chunk. Ebbinghaus forgetting curve is adopted as a forgetting mechanism to evaluate and replace the component classifiers in the memory repository. Experiments have been performed on datasets with different types of concept drifts. Compared with traditional ensemble approaches, the results show that MAE is a good algorithm with high and stable accuracy, less predicting time and moderate training time.
Keywords
data mining; pattern classification; Ebbinghaus forgetting curve; MAE algorithm; chunk-based data streams; ensemble based data stream mining; ensemble pruning; forgetting mechanisms; memorizing based adaptive ensemble algorithm; memory repository; recalling mechanisms; sequential chunks; Accuracy; Algorithm design and analysis; Classification algorithms; Clustering algorithms; Data mining; Prediction algorithms; Training; Ebbinghaus forgetting curve; data stream mining; ensemble pruning; recalling mechanism;
fLanguage
English
Publisher
ieee
Conference_Titel
Fuzzy Systems and Knowledge Discovery (FSKD), 2014 11th International Conference on
Conference_Location
Xiamen
Print_ISBN
978-1-4799-5147-5
Type
conf
DOI
10.1109/FSKD.2014.6980873
Filename
6980873
Link To Document