DocumentCode :
3726505
Title :
Naïve Bayes Classification Ensembles to Support Modeling Decisions in Data Stream Mining
Author :
Patricia E.N. Lutu
Author_Institution :
Dept. of Comput. Sci., Univ. of Pretoria, Pretoria, South Africa
fYear :
2015
Firstpage :
335
Lastpage :
340
Abstract :
Data stream mining is the process of applying data mining methods to a data stream in real-time in order to create descriptive or predictive models. Due to the dynamic nature of data streams, new classes may emerge as a data stream evolves, and the concept being modelled may change with time. This gives rise to the need to continuously make revisions to the predictive model. Revising the predictive model requires that labelled training data should be available. Manual labelling of training data may not be able to cope with the speed at which data needs to be labelled. This paper proposes a predictive modeling framework which supports two of the common decisions that need to be made in stream mining. The framework consists of two components: an online component and an offline component. The online component uses Naïve Bayes ensemble base models to make predictions for newly arrived data stream instances. The offline component consists of algorithms to combine base model predictions, determine the reliability of the ensemble predictions, select training data for new base models, create new base models, and determine whether the current online base models need to be replaced.
Keywords :
"Predictive models","Data models","Data mining","Training data","Prediction algorithms","Frequency measurement","Reliability"
Publisher :
ieee
Conference_Titel :
Computational Intelligence, 2015 IEEE Symposium Series on
Print_ISBN :
978-1-4799-7560-0
Type :
conf
DOI :
10.1109/SSCI.2015.57
Filename :
7376630
Link To Document :
بازگشت