DocumentCode :
17521
Title :
Meta Expert Learning and Efficient Pruning for Evolving Data Streams
Author :
Azarafrooz, Mahdi ; Daneshmand, Mahmoud
Author_Institution :
Howe Sch. of Technol. Manage., Stevens Inst. of Technol., Hoboken, NJ, USA
Volume :
2
Issue :
4
fYear :
2015
fDate :
Aug. 2015
Firstpage :
268
Lastpage :
273
Abstract :
Researchers have proposed several ensemble methods for the data stream environments including online bagging and boosting. These studies show that bagging methods perform better than boosting methods although the opposite is known to be true in the batch setting environments. The reason behind the weaker performance of boosting methods in the streaming environments is not clear. We have taken advantage of the algorithmic procedure of meta expert learnings for the sake of our study. The meta expert learning differs from the classic expert learning methods in that each expert starts to predict from a different point in the history. Moreover, maintaining a collection of base learners follows an algorithmic procedure. The focus of this paper is on studying the pruning function for maintaining the appropriate set of experts rather than proposing a competitive algorithm for selecting the experts. It shows how a well-structured pruning method leads to a better prediction accuracy without necessary higher memory consumption. Next, it is shown how pruning the set of base learners in the meta expert learning (in order to avoid memory exhaustion) affects the prediction accuracy for different types of drifts. In the case of time-locality drifts, the prediction accuracy is highly tied to the mathematical structure of the pruning algorithms. This observation may explain the main reason behind the weak performance of previously studied boosting methods in the streaming environments. It shows that the boosting algorithms should be designed with respect to the suitable notion of the regret metrics.
Keywords :
expert systems; learning (artificial intelligence); base learners; competitive algorithm; data stream environments; ensemble methods; expert selection; mathematical structure; meta expert learning; online bagging method; online boosting method; pruning function; regret metrics; time-locality drifts; Accuracy; Algorithm design and analysis; Bagging; Boosting; Generators; Heuristic algorithms; Prediction algorithms;
fLanguage :
English
Journal_Title :
Internet of Things Journal, IEEE
Publisher :
ieee
ISSN :
2327-4662
Type :
jour
DOI :
10.1109/JIOT.2015.2420689
Filename :
7081354
Link To Document :
بازگشت