Title :
An Ensemble of Classifiers Algorithm Based on GA for Handling Concept-Drifting Data Streams
Author :
Jinghua Guan ; Wu Guo ; Heng Chen ; OuJun Lou
Author_Institution :
Sch. of Software, Dalian Univ. of Foreign Languages Dalian, Dalian, China
Abstract :
In data streams, concepts are often not stable but change with time. In this paper, we propose a selective integration algorithm DGASEN (Dynamic GA based Selected ENsemble) for handling concept-drifting data streams. This algorithm selects a near optimal subset of base classifiers based on GA algorithm and the predictive accuracy of each base classifier on validation dataset. This paper chooses SEA(with simulating abrupt concept drift) and Hyperplane (with gradual concept drift) as experimental data sets. The experimental results demonstrate that selective integration of classifiers can be significantly better than majority voting and weighted voting, which are currently the most commonly used integration techniques for handling concept drift in ensemble learning. The experimental results show that DGASEN algorithm improves the classification accuracy of integrated algorithm in handling concept-drifting data streams.
Keywords :
data mining; genetic algorithms; learning (artificial intelligence); pattern classification; DGASEN algorithm; classifiers algorithm; concept-drifting data stream; dynamic genetic algorithm; ensemble learning; hyperplane; selected ensemble; Accuracy; Classification algorithms; Data mining; Educational institutions; Heuristic algorithms; Knowledge discovery; Prediction algorithms; Concept drift; GA; Naive Bayes; Selective ensemble;
Conference_Titel :
Parallel Architectures, Algorithms and Programming (PAAP), 2014 Sixth International Symposium on
Conference_Location :
Beijing
Print_ISBN :
978-1-4799-3844-5
DOI :
10.1109/PAAP.2014.24