Title :
An ensemble-based approach to fast classification of multi-label data streams
Author :
Kong, Xiangnan ; Yu, Philip S.
Author_Institution :
Dept. of Comput. Sci., Univ. of Illinois at Chicago, Chicago, IL, USA
Abstract :
Network operators are continuously confronted with online events, such as online messages, blog updates, etc. Due to the huge volume of these events and the fast changes of the topics, it is critical to manage them promptly and effectively. There have been many softwares and algorithms developed to conduct automatic classification over these stream data. Conventional approaches focus on single-label scenarios, where each event can only be tagged with one label. However, in many stream data, each event can be tagged with more than one labels. Effective stream classification systems should be able to consider the unique properties of multi-label stream data, such as large data volumes, label correlations and concept drifts. To address these challenges, in this paper, we propose an efficient and effective method for multi-label stream classification based on an ensemble of fading random trees. The proposed model can efficiently process high-speed multi-label stream data with concept drifts. Empirical studies on real-world tasks demonstrate that our method can maintain a high accuracy in multi-label stream classification, while providing a very efficient solution to the task.
Keywords :
classification; trees (mathematics); automatic classification; blog updates; concept drifts; ensemble-based approach; fading random trees; fast classification; highspeed multilabel stream data; label correlations; large data volumes; multilabel data streams; multilabel stream classification; online messages; single-label scenarios; stream classification systems; Data stream; data mining; multi-label classification; random tree;
Conference_Titel :
Collaborative Computing: Networking, Applications and Worksharing (CollaborateCom), 2011 7th International Conference on
Conference_Location :
Orlando, FL
Print_ISBN :
978-1-4673-0683-6
Electronic_ISBN :
978-1-936968-32-9