DocumentCode :
2192123
Title :
Clustering spatial data in the presence of obstacles: a density-based approach
Author :
Zaïane, Osmar R. ; Lee, Chi-Hoon
Author_Institution :
Database Lab., Alberta Univ., Edmonton, Alta., Canada
fYear :
2002
fDate :
2002
Firstpage :
214
Lastpage :
223
Abstract :
Clustering spatial data is a well-known problem that has been extensively studied. Grouping similar data in large 2-dimensional spaces to find hidden patterns or meaningful sub-groups has many applications such as satellite imagery, geographic information systems, medical image analysis, marketing, computer visions, etc. Although many methods have been proposed in the literature, very few have considered physical obstacles that may have significant consequences on the effectiveness of the clustering. Taking into account these constraints during the clustering process is costly and the modeling of the constraints is paramount for good performance. In this paper, we investigate the problem of clustering in the presence of constraints such as physical obstacles and introduce a new approach to model these constraints using polygons. We also propose a strategy to prune the search space and reduce the number of polygons to test during clustering. We devise a density-based clustering algorithm, DBCluC, which takes advantage of our constraint modeling to efficiently cluster data objects while considering all physical constraints. The algorithm can detect clusters of arbitrary shape and is insensitive to noise, the input order and the difficulty of constraints. Its average running complexity is O(NlogN) where N is the number of data points.
Keywords :
computational complexity; data mining; pattern clustering; visual databases; DBCluC; average running complexity; constraint modeling; data points; density-based clustering algorithm; hidden pattern finding; input order; large 2D spaces; meaningful sub-group finding; noise; physical obstacles; polygons; search space pruning; similar data grouping; spatial data clustering; Application software; Biomedical imaging; Clustering algorithms; Computer vision; Geographic Information Systems; Image analysis; Noise shaping; Satellites; Shape; Testing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Database Engineering and Applications Symposium, 2002. Proceedings. International
ISSN :
1098-8068
Print_ISBN :
0-7695-1638-6
Type :
conf
DOI :
10.1109/IDEAS.2002.1029674
Filename :
1029674
Link To Document :
بازگشت