DocumentCode
2833714
Title
An improved sampling-based DBSCAN for large spatial databases
Author
Borah, B. ; Bhattacharyya, D.K.
Author_Institution
Dept. of Inf. Technol., Tezpur Univ., Assam, India
fYear
2004
fDate
2004
Firstpage
92
Lastpage
96
Abstract
Spatial data clustering is one of the important data mining techniques for extracting knowledge from large amount of spatial data collected in various applications, such as remote sensing, GIS, computer cartography, environmental assessment and planning, etc. Several useful and popular spatial data clustering algorithms have been proposed in the past decade. DBSCAN is one of them, which can discover clusters of any arbitrary shape and can handle the noise points effectively. However, DBSCAN requires large volume of memory support because it operates on the entire database. This paper presents an improved sampling-based DBSCAN which can cluster large-scale spatial databases effectively. Experimental results included to establish that the proposed sampling-based DBSCAN outperforms DBSCAN as well as its other counterparts, in terms of execution time, without losing the quality of clustering.
Keywords
data mining; pattern clustering; sampling methods; very large databases; visual databases; data mining techniques; density based spatial clustering of application with noise; image sampling; knowledge extraction; large spatial databases; spatial data clustering algorithms; Application software; Clustering algorithms; Data mining; Geographic Information Systems; Large-scale systems; Noise shaping; Remote sensing; Shape; Spatial databases; Working environment noise;
fLanguage
English
Publisher
ieee
Conference_Titel
Intelligent Sensing and Information Processing, 2004. Proceedings of International Conference on
Print_ISBN
0-7803-8243-9
Type
conf
DOI
10.1109/ICISIP.2004.1287631
Filename
1287631
Link To Document