DocumentCode :
561186
Title :
On a Distributed Approach for Density-Based Clustering
Author :
Khac, Nhien An Le ; Kechadi, M-Tahar
Author_Institution :
Sch. of Comput. Sci., Univ. Coll. Dublin, Dublin, Ireland
Volume :
1
fYear :
2011
fDate :
18-21 Dec. 2011
Firstpage :
283
Lastpage :
286
Abstract :
Efficient extraction of useful knowledge from very large datasets is still a challenge, mainly when the datasets are distributed, heterogeneous and of different quality depending of the various nodes involved. To reduce the overhead cost due to communications, most of the existing distributed clustering approaches generates global models by aggregating local results obtained on each individual node. The complexity and quality of solutions depend highly on the quality of the aggregation. In this respect, we propose distributed density-based clustering that both reduces the communication overheads and improves the quality of the global models by considering the shapes of local clusters. From preliminary results we show that this algorithm is very promising.
Keywords :
computational complexity; knowledge acquisition; pattern clustering; datasets; distributed density based clustering; knowledge extraction; overhead cost reduction; solution complexity; solution quality; Algorithm design and analysis; Bismuth; Clustering algorithms; Data mining; Merging; Shape; Vectors; balance vector; clustering; distributed data mining; distributed platform; large datasets;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Machine Learning and Applications and Workshops (ICMLA), 2011 10th International Conference on
Conference_Location :
Honolulu, HI
Print_ISBN :
978-1-4577-2134-2
Type :
conf
DOI :
10.1109/ICMLA.2011.108
Filename :
6146985
Link To Document :
بازگشت