• DocumentCode
    2709455
  • Title

    Clustering Uncertain Data Using Voronoi Diagrams

  • Author

    Kao, Ben ; Lee, Sau Dan ; Cheung, David W. ; Ho, Wai-Shing ; Chan, K.F.

  • Author_Institution
    Dept. of Comput. Sci., Univ. of Hong Kong, Hong Kong
  • fYear
    2008
  • fDate
    15-19 Dec. 2008
  • Firstpage
    333
  • Lastpage
    342
  • Abstract
    We study the problem of clustering uncertain objects whose locations are described by probability density functions (pdf). We show that the UK-means algorithm, which generalises the k-means algorithm to handle uncertain objects, is very inefficient. The inefficiency comes from the fact that UK-means computes expected distances (ED) between objects and cluster representatives. For arbitrary pdf´s, expected distances are computed by numerical integrations, which are costly operations. We propose pruning techniques that are based on Voronoi diagrams to reduce the number of expected distance calculation. These techniques are analytically proven to be more effective than the basic bounding-box-based technique previous known in the literature. We conduct experiments to evaluate the effectiveness of our pruning techniques and to show that our techniques significantly outperform previous methods.
  • Keywords
    computational geometry; pattern clustering; probability; UK-mean algorithm; Voronoi diagram; bounding-box-based technique; expected distance calculation; k-means algorithm; probability density function; pruning technique; uncertain data clustering; Bandwidth; Clustering algorithms; Computational efficiency; Computer science; Data mining; Energy conservation; Measurement uncertainty; Mobile communication; Network servers; Probability density function; UK-means; Voronoi diagram; classification; k-means; uncertain data;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Mining, 2008. ICDM '08. Eighth IEEE International Conference on
  • Conference_Location
    Pisa
  • ISSN
    1550-4786
  • Print_ISBN
    978-0-7695-3502-9
  • Type

    conf

  • DOI
    10.1109/ICDM.2008.31
  • Filename
    4781128