DocumentCode :
1938411
Title :
A Multi-Relational Hierarchical Clustering Algorithm Based on Shared Nearest Neighbor Similarity
Author :
Guo, Jing-Feng ; Zhao, Yu-Yan ; Li, Jing
Volume :
7
fYear :
2007
fDate :
19-22 Aug. 2007
Firstpage :
3951
Lastpage :
3955
Abstract :
The clustering about relational databases is an active study subject in data mining. In this paper, we introduce a multi-relational hierarchical clustering algorithm based on shared nearest neighbor similarity (MHSNNS). First, this algorithm joins every table through the tuple 1D propagation. Then, groups objects into a large number of relatively small sub-clusters using the shared nearest neighbor algorithm and the cluster cohesion. Last, find the genuine clusters by repeatedly combining these sub-clusters using the cluster separation. The experiment shows the efficiency and scalability of this approach.
Keywords :
data mining; hierarchical systems; pattern clustering; relational databases; cluster separation; data mining; multirelational hierarchical clustering; relational databases; shared nearest neighbor similarity; Clustering algorithms; Cybernetics; Data mining; Educational institutions; Machine learning; Machine learning algorithms; Nearest neighbor searches; Partitioning algorithms; Relational databases; Scalability; Data mining; Hierarchical clustering; Multi-relational clustering; Relational databases; Shared nearest neighbor;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Machine Learning and Cybernetics, 2007 International Conference on
Conference_Location :
Hong Kong
Print_ISBN :
978-1-4244-0973-0
Electronic_ISBN :
978-1-4244-0973-0
Type :
conf
DOI :
10.1109/ICMLC.2007.4370836
Filename :
4370836
Link To Document :
بازگشت