DocumentCode :
2037929
Title :
Parallel hierarchical clustering on shared memory platforms
Author :
Hendrix, William ; Ali Patwary, Md Mostofa ; Agrawal, Ankit ; Wei-keng Liao ; Choudhary, Alok
Author_Institution :
Dept. of Electr. Eng. & Comput. Sci, Northwestern Univ. Evanston, Evanston, IL, USA
fYear :
2012
fDate :
18-22 Dec. 2012
Firstpage :
1
Lastpage :
9
Abstract :
Hierarchical clustering has many advantages over traditional clustering algorithms like k-means, but it suffers from higher computational costs and a less obvious parallel structure. Thus, in order to scale this technique up to larger datasets, we present SHRINK, a novel shared-memory algorithm for single-linkage hierarchical clustering based on merging the solutions from overlapping sub-problems. In our experiments, we find that SHRINK provides a speedup of 18-20 on 36 cores on both real and synthetic datasets of up to 250,000 points. Source code for SHRINK is available for download on our website, http://cucis.ece.northwestern.edu.
Keywords :
pattern clustering; shared memory systems; SHRINK; k-means; overlapping subproblems; parallel hierarchical clustering; shared memory platforms; single-linkage hierarchical clustering;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
High Performance Computing (HiPC), 2012 19th International Conference on
Conference_Location :
Pune
Print_ISBN :
978-1-4673-2372-7
Electronic_ISBN :
978-1-4673-2370-3
Type :
conf
DOI :
10.1109/HiPC.2012.6507511
Filename :
6507511
Link To Document :
بازگشت