• DocumentCode
    2833714
  • Title

    An improved sampling-based DBSCAN for large spatial databases

  • Author

    Borah, B. ; Bhattacharyya, D.K.

  • Author_Institution
    Dept. of Inf. Technol., Tezpur Univ., Assam, India
  • fYear
    2004
  • fDate
    2004
  • Firstpage
    92
  • Lastpage
    96
  • Abstract
    Spatial data clustering is one of the important data mining techniques for extracting knowledge from large amount of spatial data collected in various applications, such as remote sensing, GIS, computer cartography, environmental assessment and planning, etc. Several useful and popular spatial data clustering algorithms have been proposed in the past decade. DBSCAN is one of them, which can discover clusters of any arbitrary shape and can handle the noise points effectively. However, DBSCAN requires large volume of memory support because it operates on the entire database. This paper presents an improved sampling-based DBSCAN which can cluster large-scale spatial databases effectively. Experimental results included to establish that the proposed sampling-based DBSCAN outperforms DBSCAN as well as its other counterparts, in terms of execution time, without losing the quality of clustering.
  • Keywords
    data mining; pattern clustering; sampling methods; very large databases; visual databases; data mining techniques; density based spatial clustering of application with noise; image sampling; knowledge extraction; large spatial databases; spatial data clustering algorithms; Application software; Clustering algorithms; Data mining; Geographic Information Systems; Large-scale systems; Noise shaping; Remote sensing; Shape; Spatial databases; Working environment noise;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Intelligent Sensing and Information Processing, 2004. Proceedings of International Conference on
  • Print_ISBN
    0-7803-8243-9
  • Type

    conf

  • DOI
    10.1109/ICISIP.2004.1287631
  • Filename
    1287631