DocumentCode
243747
Title
Parallel Frequent Pattern Mining without Candidate Generation on GPUs
Author
Fei Wang ; Bo Yuan
Author_Institution
Div. of Inf., Tsinghua Univ., Shenzhen, China
fYear
2014
fDate
14-14 Dec. 2014
Firstpage
1046
Lastpage
1052
Abstract
The graphics processing unit (GPU) has evolved into a key part of today´s heterogeneous parallel computing architecture. A number of influential data mining algorithms have been parallelized on GPUs including frequent pattern mining algorithms, such as Apriori. Unfortunately, due to two major challenges, the more effective method for mining frequent patterns without candidate generation named FP-Growth has not been implemented on GPUs. Firstly, it is very hard to efficiently build the FP-Tree in parallel on GPUs as it is an inherently sequential process. Secondly, mining the FP-Tree in parallel is also a difficult task. In this paper, we propose a fully parallel method to build the FP-Tree on CUDA-enabled GPUs and implement a novel parallel algorithm for mining all frequent patterns using the latest CUDA Dynamic Parallelism techniques. We show that, on a range of representative benchmark datasets, the proposed GPU-based FP-Growth algorithm can achieve significant speedups compared to the original algorithm.
Keywords
data mining; graphics processing units; parallel algorithms; parallel architectures; trees (mathematics); CUDA dynamic parallelism technique; FP-tree; GPU; data mining algorithm; graphics processing unit; parallel algorithm; parallel frequent pattern mining; Algorithm design and analysis; Data mining; Graphics processing units; Heuristic algorithms; Instruction sets; Itemsets; Parallel processing; Dynamic Parallelism; FP-Growth; FP-Tree; GPU;
fLanguage
English
Publisher
ieee
Conference_Titel
Data Mining Workshop (ICDMW), 2014 IEEE International Conference on
Conference_Location
Shenzhen
Print_ISBN
978-1-4799-4275-6
Type
conf
DOI
10.1109/ICDMW.2014.71
Filename
7022712
Link To Document