• DocumentCode
    2777301
  • Title

    Density-Based Data Streams Clustering over Sliding Windows

  • Author

    Ren, Jiadong ; Ma, Ruiqing

  • Author_Institution
    Coll. of Inf. Sci. & Eng., Yanshan Univ., Qinhuangdao, China
  • Volume
    5
  • fYear
    2009
  • fDate
    14-16 Aug. 2009
  • Firstpage
    248
  • Lastpage
    252
  • Abstract
    Data stream clustering is an important task in data stream mining. In this paper, we propose SDStream, a new method for performing density-based data streams clustering over sliding windows. SDStream adopts CluStream clustering framework. In the online component, the potential core-micro-cluster and outlier micro-cluster structures are introduced to maintain the potential clusters and outliers. They are stored in the form of exponential histogram of cluster feature (EHCF) in main memory and are maintained by the maintenance of EHCFs. Outdated micro-clusters which need to be deleted are found by the value of t in temporal cluster feature (TCF). In the offline component, the final clusters of arbitrary shape are generated according to all the potential core-micro-clusters maintained online by DBSCAN algorithm. Experimental results show that SDStream which can generate clusters of arbitrary shape has a much higher clustering quality than CluStream which generates spherical clusters.
  • Keywords
    data mining; pattern clustering; CluStream clustering; DBSCAN algorithm; SDStream; core-microcluster structure; data stream mining; density-based data stream clustering; exponential histogram of cluster feature; outlier microcluster structure; sliding window; temporal cluster feature; Cities and towns; Clustering algorithms; Data mining; Data structures; Educational institutions; Electronic mail; Fuzzy systems; Histograms; Partitioning algorithms; Shape; data stream; density-based clustering; sliding windows;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Fuzzy Systems and Knowledge Discovery, 2009. FSKD '09. Sixth International Conference on
  • Conference_Location
    Tianjin
  • Print_ISBN
    978-0-7695-3735-1
  • Type

    conf

  • DOI
    10.1109/FSKD.2009.553
  • Filename
    5360620