• DocumentCode
    2192123
  • Title

    Clustering spatial data in the presence of obstacles: a density-based approach

  • Author

    Zaïane, Osmar R. ; Lee, Chi-Hoon

  • Author_Institution
    Database Lab., Alberta Univ., Edmonton, Alta., Canada
  • fYear
    2002
  • fDate
    2002
  • Firstpage
    214
  • Lastpage
    223
  • Abstract
    Clustering spatial data is a well-known problem that has been extensively studied. Grouping similar data in large 2-dimensional spaces to find hidden patterns or meaningful sub-groups has many applications such as satellite imagery, geographic information systems, medical image analysis, marketing, computer visions, etc. Although many methods have been proposed in the literature, very few have considered physical obstacles that may have significant consequences on the effectiveness of the clustering. Taking into account these constraints during the clustering process is costly and the modeling of the constraints is paramount for good performance. In this paper, we investigate the problem of clustering in the presence of constraints such as physical obstacles and introduce a new approach to model these constraints using polygons. We also propose a strategy to prune the search space and reduce the number of polygons to test during clustering. We devise a density-based clustering algorithm, DBCluC, which takes advantage of our constraint modeling to efficiently cluster data objects while considering all physical constraints. The algorithm can detect clusters of arbitrary shape and is insensitive to noise, the input order and the difficulty of constraints. Its average running complexity is O(NlogN) where N is the number of data points.
  • Keywords
    computational complexity; data mining; pattern clustering; visual databases; DBCluC; average running complexity; constraint modeling; data points; density-based clustering algorithm; hidden pattern finding; input order; large 2D spaces; meaningful sub-group finding; noise; physical obstacles; polygons; search space pruning; similar data grouping; spatial data clustering; Application software; Biomedical imaging; Clustering algorithms; Computer vision; Geographic Information Systems; Image analysis; Noise shaping; Satellites; Shape; Testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Database Engineering and Applications Symposium, 2002. Proceedings. International
  • ISSN
    1098-8068
  • Print_ISBN
    0-7695-1638-6
  • Type

    conf

  • DOI
    10.1109/IDEAS.2002.1029674
  • Filename
    1029674