• DocumentCode
    1579
  • Title

    SVStream: A Support Vector-Based Algorithm for Clustering Data Streams

  • Author

    Chang-Dong Wang ; Jian-Huang Lai ; Dong Huang ; Wei-Shi Zheng

  • Author_Institution
    Sch. of Inf. Sci. & Technol., Sun Yat-sen Univ., Guangzhou, China
  • Volume
    25
  • Issue
    6
  • fYear
    2013
  • fDate
    Jun-13
  • Firstpage
    1410
  • Lastpage
    1424
  • Abstract
    In this paper, we propose a novel data stream clustering algorithm, termed SVStream, which is based on support vector domain description and support vector clustering. In the proposed algorithm, the data elements of a stream are mapped into a kernel space, and the support vectors are used as the summary information of the historical elements to construct cluster boundaries of arbitrary shape. To adapt to both dramatic and gradual changes, multiple spheres are dynamically maintained, each describing the corresponding data domain presented in the data stream. By allowing for bounded support vectors (BSVs), the proposed SVStream algorithm is capable of identifying overlapping clusters. A BSV decaying mechanism is designed to automatically detect and remove outliers (noise). We perform experiments over synthetic and real data streams, with the overlapping, evolving, and noise situations taken into consideration. Comparison results with state-of-the-art data stream clustering methods demonstrate the effectiveness and efficiency of the proposed method.
  • Keywords
    pattern clustering; support vector machines; BSV decaying mechanism; SVStream algorithm; automatic outlier detection; automatic outlier removal; bounded support vectors; data stream clustering algorithm; historical elements; kernel space; overlapping cluster identification; summary information; support vector clustering; support vector domain description; support vector-based algorithm; Clustering algorithms; Kernel; Labeling; Merging; Shape; Static VAr compensators; Support vector machines; Data stream clustering; clusters of arbitrary shape; evolving; noise; overlapping; support vector;
  • fLanguage
    English
  • Journal_Title
    Knowledge and Data Engineering, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1041-4347
  • Type

    jour

  • DOI
    10.1109/TKDE.2011.263
  • Filename
    6109258