Title :
DBSCAN: Past, present and future
Author :
Khan, K. ; Rehman, Saif Ur ; Aziz, Khurram ; Fong, Simon ; Sarasvady, S.
Author_Institution :
Dept. of Comput. Sci., SZABIST, Islamabad, Pakistan
Abstract :
Data Mining is all about data analysis techniques. It is useful for extracting hidden and interesting patterns from large datasets. Clustering techniques are important when it comes to extracting knowledge from large amount of spatial data collected from various applications including GIS, satellite images, X-ray crystallography, remote sensing and environmental assessment and planning etc. To extract useful pattern from these complex data sources several popular spatial data clustering techniques have been proposed. DBSCAN (Density Based Spatial Clustering of Applications with Noise) is a pioneer density based algorithm. It can discover clusters of any arbitrary shape and size in databases containing even noise and outliers. DBSCAN however are known to have a number of problems such as: (a) it requires user´s input to specify parameter values for executing the algorithm; (b) it is prone to dilemma in deciding meaningful clusters from datasets with varying densities; (c) and it incurs certain computational complexity. Many researchers attempted to enhance the basic DBSCAN algorithm, in order to overcome these drawbacks, such as VDBSCAN, FDBSCAN, DD_DBSCAN, and IDBSCAN. In this study, we survey over different variations of DBSCAN algorithms that were proposed so far. These variations are critically evaluated and their limitations are also listed.
Keywords :
data analysis; data mining; knowledge acquisition; pattern clustering; DBSCAN; DD_DBSCAN; FDBSCAN; IDBSCAN; VDBSCAN; complex data sources; computational complexity; data analysis technique; data mining; density based spatial clustering of applications with noise; knowledge extraction; pioneer density based algorithm; spatial data clustering technique; spatial data collection; Algorithm design and analysis; Clustering algorithms; Data mining; Noise; Partitioning algorithms; Shape; Spatial databases; Clustering; DBSCAN; data mining algorithms; density; sampling; spatial data;
Conference_Titel :
Applications of Digital Information and Web Technologies (ICADIWT), 2014 Fifth International Conference on the
Conference_Location :
Bangalore
Print_ISBN :
978-1-4799-2258-1
DOI :
10.1109/ICADIWT.2014.6814687