Title :
A Concept Drifting Based Clustering Framework for Data Streams
Author :
Gansen Zhao ; Ziliu Li ; Fujiao Liu ; Yong Tang
Author_Institution :
Sch. of Comput. Sci., South China Normal Univ., Guangzhou, China
Abstract :
It has attracted extensive interests to discover knowledge from data streams generated in real-time. At present, there are some data streams mining frameworks, providing mining solutions for data streams. This paper proposes an on-demand framework (SRAStream) based on the concept drifting detection. SRAStream allows quick clustering with certain accuracy using only limited resource, enabling the real-time mining of very large data stream with acceptable cost. A concept drifting detecting algorithm is proposed, which employs a quick clustering solution to achieve an accurate detection and then perform the related detecting calculation. Experiments have been conducted based on the UCI datasets. The result suggests that the proposed framework does work well and improve the processing speed greatly in data streams clustering.
Keywords :
data mining; pattern clustering; SRAStream; UCI datasets; concept drifting based clustering framework; concept drifting detecting algorithm; data stream clustering; knowledge discovery; on-demand framework; real-time very large data stream mining; Accuracy; Algorithm design and analysis; Clustering algorithms; Context; Data mining; Monitoring; Real-time systems; Big Data; Concept Drifting; Data Streams; Framework; On-Demand Clustering;
Conference_Titel :
Emerging Intelligent Data and Web Technologies (EIDWT), 2013 Fourth International Conference on
Conference_Location :
Xi´an
Print_ISBN :
978-1-4799-2140-9
DOI :
10.1109/EIDWT.2013.26