Title :
GDCIC: A Grid-based Density-Confidence-Interval Clustering Algorithm for Multi-density Dataset in Large Spatial Database
Author :
Gao, Song ; Xia, Ying
Author_Institution :
Coll. of Comput. Sci. & Technol., Chongqing Univ. of Posts & Telecommun.
Abstract :
Spatial clustering, which groups similar objects based on their distance, connectivity, or their relative density in space, is an important component of spatial data mining. The problem of detecting clusters of points belonging to a spatial point process arises in many applications. One of the challenges in spatial clustering is to find clusters under various cluster number, object distribution as well as multi-density. In this paper, we propose GDCIC, a grid-based density-confidence-interval clustering algorithm for multi-density in large spatial database. By using the technique of confidence limits of the density confidence interval, accurate density estimation in local areas can be produced to form local density thresholds. Local dense areas are distinguished from sparse areas or outliers with the help of these thresholds. An optional procedure is included in GDCIC to optimize the clustering result. The experimental studies on both synthetic and real datasets show its high accuracy and performance over existing algorithms
Keywords :
data mining; grid computing; pattern clustering; visual databases; grid-based density-confidence-interval clustering algorithm; large spatial database; multidensity dataset; spatial clustering; spatial data mining; Cities and towns; Clustering algorithms; Computer science; Data mining; Educational institutions; Geographic Information Systems; Information technology; Shape; Space technology; Spatial databases;
Conference_Titel :
Intelligent Systems Design and Applications, 2006. ISDA '06. Sixth International Conference on
Conference_Location :
Jinan
Print_ISBN :
0-7695-2528-8
DOI :
10.1109/ISDA.2006.161