DocumentCode :
2709455
Title :
Clustering Uncertain Data Using Voronoi Diagrams
Author :
Kao, Ben ; Lee, Sau Dan ; Cheung, David W. ; Ho, Wai-Shing ; Chan, K.F.
Author_Institution :
Dept. of Comput. Sci., Univ. of Hong Kong, Hong Kong
fYear :
2008
fDate :
15-19 Dec. 2008
Firstpage :
333
Lastpage :
342
Abstract :
We study the problem of clustering uncertain objects whose locations are described by probability density functions (pdf). We show that the UK-means algorithm, which generalises the k-means algorithm to handle uncertain objects, is very inefficient. The inefficiency comes from the fact that UK-means computes expected distances (ED) between objects and cluster representatives. For arbitrary pdf´s, expected distances are computed by numerical integrations, which are costly operations. We propose pruning techniques that are based on Voronoi diagrams to reduce the number of expected distance calculation. These techniques are analytically proven to be more effective than the basic bounding-box-based technique previous known in the literature. We conduct experiments to evaluate the effectiveness of our pruning techniques and to show that our techniques significantly outperform previous methods.
Keywords :
computational geometry; pattern clustering; probability; UK-mean algorithm; Voronoi diagram; bounding-box-based technique; expected distance calculation; k-means algorithm; probability density function; pruning technique; uncertain data clustering; Bandwidth; Clustering algorithms; Computational efficiency; Computer science; Data mining; Energy conservation; Measurement uncertainty; Mobile communication; Network servers; Probability density function; UK-means; Voronoi diagram; classification; k-means; uncertain data;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Mining, 2008. ICDM '08. Eighth IEEE International Conference on
Conference_Location :
Pisa
ISSN :
1550-4786
Print_ISBN :
978-0-7695-3502-9
Type :
conf
DOI :
10.1109/ICDM.2008.31
Filename :
4781128
Link To Document :
بازگشت