DocumentCode
2709455
Title
Clustering Uncertain Data Using Voronoi Diagrams
Author
Kao, Ben ; Lee, Sau Dan ; Cheung, David W. ; Ho, Wai-Shing ; Chan, K.F.
Author_Institution
Dept. of Comput. Sci., Univ. of Hong Kong, Hong Kong
fYear
2008
fDate
15-19 Dec. 2008
Firstpage
333
Lastpage
342
Abstract
We study the problem of clustering uncertain objects whose locations are described by probability density functions (pdf). We show that the UK-means algorithm, which generalises the k-means algorithm to handle uncertain objects, is very inefficient. The inefficiency comes from the fact that UK-means computes expected distances (ED) between objects and cluster representatives. For arbitrary pdf´s, expected distances are computed by numerical integrations, which are costly operations. We propose pruning techniques that are based on Voronoi diagrams to reduce the number of expected distance calculation. These techniques are analytically proven to be more effective than the basic bounding-box-based technique previous known in the literature. We conduct experiments to evaluate the effectiveness of our pruning techniques and to show that our techniques significantly outperform previous methods.
Keywords
computational geometry; pattern clustering; probability; UK-mean algorithm; Voronoi diagram; bounding-box-based technique; expected distance calculation; k-means algorithm; probability density function; pruning technique; uncertain data clustering; Bandwidth; Clustering algorithms; Computational efficiency; Computer science; Data mining; Energy conservation; Measurement uncertainty; Mobile communication; Network servers; Probability density function; UK-means; Voronoi diagram; classification; k-means; uncertain data;
fLanguage
English
Publisher
ieee
Conference_Titel
Data Mining, 2008. ICDM '08. Eighth IEEE International Conference on
Conference_Location
Pisa
ISSN
1550-4786
Print_ISBN
978-0-7695-3502-9
Type
conf
DOI
10.1109/ICDM.2008.31
Filename
4781128
Link To Document