• DocumentCode
    243747
  • Title

    Parallel Frequent Pattern Mining without Candidate Generation on GPUs

  • Author

    Fei Wang ; Bo Yuan

  • Author_Institution
    Div. of Inf., Tsinghua Univ., Shenzhen, China
  • fYear
    2014
  • fDate
    14-14 Dec. 2014
  • Firstpage
    1046
  • Lastpage
    1052
  • Abstract
    The graphics processing unit (GPU) has evolved into a key part of today´s heterogeneous parallel computing architecture. A number of influential data mining algorithms have been parallelized on GPUs including frequent pattern mining algorithms, such as Apriori. Unfortunately, due to two major challenges, the more effective method for mining frequent patterns without candidate generation named FP-Growth has not been implemented on GPUs. Firstly, it is very hard to efficiently build the FP-Tree in parallel on GPUs as it is an inherently sequential process. Secondly, mining the FP-Tree in parallel is also a difficult task. In this paper, we propose a fully parallel method to build the FP-Tree on CUDA-enabled GPUs and implement a novel parallel algorithm for mining all frequent patterns using the latest CUDA Dynamic Parallelism techniques. We show that, on a range of representative benchmark datasets, the proposed GPU-based FP-Growth algorithm can achieve significant speedups compared to the original algorithm.
  • Keywords
    data mining; graphics processing units; parallel algorithms; parallel architectures; trees (mathematics); CUDA dynamic parallelism technique; FP-tree; GPU; data mining algorithm; graphics processing unit; parallel algorithm; parallel frequent pattern mining; Algorithm design and analysis; Data mining; Graphics processing units; Heuristic algorithms; Instruction sets; Itemsets; Parallel processing; Dynamic Parallelism; FP-Growth; FP-Tree; GPU;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Mining Workshop (ICDMW), 2014 IEEE International Conference on
  • Conference_Location
    Shenzhen
  • Print_ISBN
    978-1-4799-4275-6
  • Type

    conf

  • DOI
    10.1109/ICDMW.2014.71
  • Filename
    7022712