DocumentCode
3606897
Title
Ensemble of distributed learners for online classification of dynamic data streams
Author
Canzian, Luca ; Yu Zhang ; Van der Schaar, Mihaela
Author_Institution
Qascom, Bassano del Grappa, Italy
Volume
1
Issue
3
fYear
2015
Firstpage
180
Lastpage
194
Abstract
We present a distributed online learning scheme to classify data captured from distributed and dynamic data sources. Our scheme consists of multiple distributed local learners, which analyze different streams of data that are correlated to a common event that needs to be classified. Each learner uses a local classifier to make a local prediction. The local predictions are then collected by each learner and combined using a weighted majority rule to output the final prediction. We propose a novel online ensemble learning algorithm to update the aggregation rule in order to adapt to the underlying data dynamics. We rigorously determine an upper bound for the worst-case mis-classification probability of our algorithm, which tends asymptotically to 0 if the misclassification probability of the best (unknown) static aggregation rule is 0. Then we extend our algorithm to address challenges specific to the distributed implementation and prove new bounds that apply to these settings. Finally, we test our scheme by performing an evaluation study on several data sets.
Keywords
data handling; distributed processing; learning (artificial intelligence); pattern classification; data dynamics; distributed learners; distributed online learning; dynamic data sources; dynamic data streams; multiple distributed local learners; online classification; online ensemble learning algorithm; Distributed databases; Heuristic algorithms; Indexes; Information processing; Prediction algorithms; Pulse width modulation; Training; Big data; Online learning; big data; classification; concept drift; distributed learning; dynamic streams; ensemble of classifiers; machine learning; online learning;
fLanguage
English
Journal_Title
Signal and Information Processing over Networks, IEEE Transactions on
Publisher
ieee
ISSN
2373-776X
Type
jour
DOI
10.1109/TSIPN.2015.2470125
Filename
7274771
Link To Document