• DocumentCode
    243752
  • Title

    HDminer: Efficient Mining of High Dimensional Frequent Closed Patterns from Dense Data

  • Author

    Jianpeng Xu ; Shufan Ji

  • Author_Institution
    Dept. of Comput. Sci. & Eng., Michigan State Univ., East Lansing, MI, USA
  • fYear
    2014
  • fDate
    14-14 Dec. 2014
  • Firstpage
    1061
  • Lastpage
    1067
  • Abstract
    Frequent closed pattern mining has been developed for decades, mostly on a two dimensional matrix. This paper addresses the problem of mining high dimensional frequent closed patterns (nFCPs) from dense binary dataset, where the dataset is represented by a high dimensional cube. As existing FP-tree or enumeration tree based algorithms do not suit for n-dimensional dense data, we are motivated to propose a novel algorithm called HDminer for nFCPs mining. HDminer employs effective search space partition and pruning strategies to enhance the mining efficiency. We have implemented HDminer, and the performance studies on synthetic data and real microarray data show its superiority over existing algorithms.
  • Keywords
    data mining; search problems; trees (mathematics); FP-tree; HDminer; dense binary dataset; enumeration tree based algorithm; frequent closed pattern mining; high dimensional cube; high dimensional frequent closed pattern; mining efficiency; n-dimensional dense data; nFCP mining; pruning strategy; real microarray data; search space partition; synthetic data; two dimensional matrix; Biology; Data mining; Data processing; Educational institutions; Out of order; Partitioning algorithms; Three-dimensional displays; Frequent Pattern Mining; HDminer; High Dimensional Data;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Mining Workshop (ICDMW), 2014 IEEE International Conference on
  • Conference_Location
    Shenzhen
  • Print_ISBN
    978-1-4799-4275-6
  • Type

    conf

  • DOI
    10.1109/ICDMW.2014.59
  • Filename
    7022714