DocumentCode :
3680234
Title :
HDenDist: Nonlinear Hierarchical Clustering Based on Density and Min-distance
Author :
Wen-Qi Fan; Chang-Dong Wang; Yuan-Wei Chen; Jian-Huang Lai
Author_Institution :
Sch. of Mobile Inf. Eng., Sun Yat-sen Univ., Zhuhai, China
fYear :
2015
Firstpage :
45
Lastpage :
50
Abstract :
Hierarchical clustering has received a great amount of attention due to the capability of capturing hierarchical cluster structure in an unsupervised way. Despite great success, most of the existing hierarchical clustering algorithms have some drawbacks: (1) difficulty in selecting clusters to merge or split, (2) inefficient and inaccurate cluster validation, (3) limitation to only linearly separable clusters. To address the above issues, this paper proposes a new nonlinear hierarchical clustering method termed HDenDist. The proposed method is based on two observations/designs associated with density and min-distance. One is that cluster centers have a higher density and are surrounded by data points of lower density, and the distance between cluster centers is relatively long, the other is that we design a min-distance between nodes, which can be used to determine how to divide the nodes in the hierarchical tree into two sub-cluster nodes. Some dividing and ruling tricks are designed that can further reduce the sensitivity to parameters. What´s more, the density and distance are combined to determine when to terminate the split of the cluster nodes. In experimental studies, the proposed method has shown promising results on real datasets.
Keywords :
"Clustering algorithms","Clustering methods","Big data","Algorithm design and analysis","Yttrium","Density measurement","Spirals"
Publisher :
ieee
Conference_Titel :
Big Data and Cloud Computing (BDCloud), 2015 IEEE Fifth International Conference on
Type :
conf
DOI :
10.1109/BDCloud.2015.16
Filename :
7310714
Link To Document :
بازگشت