DocumentCode
3758701
Title
A novel algorithm DBCAPSIC for clustering non-numeric data
Author
Jinkun Geng;Daren Ye;Ping Luo
Author_Institution
School of Software, Beihang University, Beijing 100191, China
fYear
2015
Firstpage
295
Lastpage
304
Abstract
Data mining techniques are playing an important role in the analysis of mass network information and big data nowadays. The cluster analysis, as a main kind of method in data mining, draws great interest from researchers of various fields who proposed many algorithms such as k-means algorithm and its variants, density-based algorithm and its variants. However, these algorithms all have their own problems. This paper focuses on some of the problems and proposes a novel algorithm DBCAPSIC. The algorithm overcomes the k-means algorithm´s sensitivity to initial conditions and avoids common density-based algorithms´ "clustering failure" in some cases. Also, the algorithm has the linear time complexity of O{n), compared to the quadratic time complexity of common density-based clustering algorithms.
Keywords
"Clustering algorithms","Algorithm design and analysis","Time complexity","Sensitivity","Partitioning algorithms","Cost function","Data mining"
Publisher
ieee
Conference_Titel
Advanced Information Technology, Electronic and Automation Control Conference (IAEAC), 2015 IEEE
Print_ISBN
978-1-4799-1979-6
Type
conf
DOI
10.1109/IAEAC.2015.7428564
Filename
7428564
Link To Document