DocumentCode :
1579
Title :
SVStream: A Support Vector-Based Algorithm for Clustering Data Streams
Author :
Chang-Dong Wang ; Jian-Huang Lai ; Dong Huang ; Wei-Shi Zheng
Author_Institution :
Sch. of Inf. Sci. & Technol., Sun Yat-sen Univ., Guangzhou, China
Volume :
25
Issue :
6
fYear :
2013
fDate :
Jun-13
Firstpage :
1410
Lastpage :
1424
Abstract :
In this paper, we propose a novel data stream clustering algorithm, termed SVStream, which is based on support vector domain description and support vector clustering. In the proposed algorithm, the data elements of a stream are mapped into a kernel space, and the support vectors are used as the summary information of the historical elements to construct cluster boundaries of arbitrary shape. To adapt to both dramatic and gradual changes, multiple spheres are dynamically maintained, each describing the corresponding data domain presented in the data stream. By allowing for bounded support vectors (BSVs), the proposed SVStream algorithm is capable of identifying overlapping clusters. A BSV decaying mechanism is designed to automatically detect and remove outliers (noise). We perform experiments over synthetic and real data streams, with the overlapping, evolving, and noise situations taken into consideration. Comparison results with state-of-the-art data stream clustering methods demonstrate the effectiveness and efficiency of the proposed method.
Keywords :
pattern clustering; support vector machines; BSV decaying mechanism; SVStream algorithm; automatic outlier detection; automatic outlier removal; bounded support vectors; data stream clustering algorithm; historical elements; kernel space; overlapping cluster identification; summary information; support vector clustering; support vector domain description; support vector-based algorithm; Clustering algorithms; Kernel; Labeling; Merging; Shape; Static VAr compensators; Support vector machines; Data stream clustering; clusters of arbitrary shape; evolving; noise; overlapping; support vector;
fLanguage :
English
Journal_Title :
Knowledge and Data Engineering, IEEE Transactions on
Publisher :
ieee
ISSN :
1041-4347
Type :
jour
DOI :
10.1109/TKDE.2011.263
Filename :
6109258
Link To Document :
بازگشت