DocumentCode
2789487
Title
A Multi-Level Parallel Implementation of a Program for Finding Frequent Patterns in a Large Sparse Graph
Author
Reinhardt, Steve ; Karypis, George
Author_Institution
SGI, Eagan, MN
fYear
2007
fDate
26-30 March 2007
Firstpage
1
Lastpage
8
Abstract
Graphs capture the essential elements of many problems broadly defined as searching or categorizing. With the rapid increase of data volumes from sensors, many application disciplines need to process larger graphs quickly. This paper presents the results of parallelizing with OpenMP an algorithm that finds, in a single large labeled undirected sparse graph, the connected subgraphs with a given minimum number of edge-disjoint embeddings. Parallelism is exploited at two levels in the algorithm. The lack of a priori knowledge of the extent of parallelism for a given input required use of a dynamic, multi-level approach based on the proposed OpenMP taskq/task extensions. The parallel implementation required the addition of 21 directives and about 50 accompanying lines of code, in an original code of about 15,000 lines. Experimental results show excellent speed-up to 30 processors for the graphs used, with a best speed-up of 26.1 compared to the serial version. The taskq/task constructs show promise for problems exhibiting unstructured parallelism.
Keywords
data mining; graph theory; message passing; parallel programming; OpenMP algorithm; frequent patterns; large labeled undirected sparse graph; multilevel parallel implementation; Application software; Biochemical analysis; Chemical compounds; Computer science; Data mining; Degradation; Parallel languages; Parallel processing; Runtime library; Sensor arrays; OpenMP; data mining; frequent subgraph; parallel processing; pattern discovery; unstructured parallelism;
fLanguage
English
Publisher
ieee
Conference_Titel
Parallel and Distributed Processing Symposium, 2007. IPDPS 2007. IEEE International
Conference_Location
Long Beach, CA
Print_ISBN
1-4244-0910-1
Electronic_ISBN
1-4244-0910-1
Type
conf
DOI
10.1109/IPDPS.2007.370404
Filename
4228132
Link To Document