• DocumentCode
    2016205
  • Title

    An algorithm for Clustering Uncertain Data Streams over Sliding Windows

  • Author

    Huang, Guoyan ; Liang, Dapeng ; Ren, Jiadong ; Hu, Changzhen

  • Author_Institution
    Coll. of Inf. Sci. & Eng., Yanshan Univ., Qinhuangdao, China
  • fYear
    2010
  • fDate
    16-18 Aug. 2010
  • Firstpage
    173
  • Lastpage
    177
  • Abstract
    The existing algorithms for clustering data streams with uncertainty can not analyze recent data in detail. In this paper, we propose SWCUStreams (Clustering Uncertain Data Streams over Sliding Windows) to cluster uncertain data streams, which can obtain the distribution character of recent data by maintaining the Exponential Histogram of Uncertainty Cluster Feature (EHUCF). SWCUStreams adopts the clustering framework of CluStream. In the online micro-cluster phase, Uncertainty Temporal Cluster Feature (UTCF) is defined to describe the uncertainty tuples. Based on the Uncertainty Temporal Cluster Feature (UTCF), Exponential Histogram of Uncertainty Cluster Feature is proposed to store the distribution character of recent data as well as used to dynamically delete expired records included in EHUCF by associating with UTCF. In the offline macro-cluster phase, the final clustering results will be generated according to the statistic information of Exponential Histogram of Uncertainty Cluster Feature (EHUCF) by UK-means algorithm. The experimental results over different types of data sets show that the cluster quality of SWCUStreams is higher.
  • Keywords
    data handling; data mining; pattern clustering; statistical analysis; uncertainty handling; SWCUStreams; UK mean algorithm; data clustering; exponential histogram; sliding window; uncertain data stream; uncertainty temporal cluster feature; Clustering; Sliding windows; Uncertain data streams;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Digital Content, Multimedia Technology and its Applications (IDC), 2010 6th International Conference on
  • Conference_Location
    Seoul
  • Print_ISBN
    978-1-4244-7607-7
  • Electronic_ISBN
    978-8-9886-7827-5
  • Type

    conf

  • Filename
    5568709