DocumentCode
243752
Title
HDminer: Efficient Mining of High Dimensional Frequent Closed Patterns from Dense Data
Author
Jianpeng Xu ; Shufan Ji
Author_Institution
Dept. of Comput. Sci. & Eng., Michigan State Univ., East Lansing, MI, USA
fYear
2014
fDate
14-14 Dec. 2014
Firstpage
1061
Lastpage
1067
Abstract
Frequent closed pattern mining has been developed for decades, mostly on a two dimensional matrix. This paper addresses the problem of mining high dimensional frequent closed patterns (nFCPs) from dense binary dataset, where the dataset is represented by a high dimensional cube. As existing FP-tree or enumeration tree based algorithms do not suit for n-dimensional dense data, we are motivated to propose a novel algorithm called HDminer for nFCPs mining. HDminer employs effective search space partition and pruning strategies to enhance the mining efficiency. We have implemented HDminer, and the performance studies on synthetic data and real microarray data show its superiority over existing algorithms.
Keywords
data mining; search problems; trees (mathematics); FP-tree; HDminer; dense binary dataset; enumeration tree based algorithm; frequent closed pattern mining; high dimensional cube; high dimensional frequent closed pattern; mining efficiency; n-dimensional dense data; nFCP mining; pruning strategy; real microarray data; search space partition; synthetic data; two dimensional matrix; Biology; Data mining; Data processing; Educational institutions; Out of order; Partitioning algorithms; Three-dimensional displays; Frequent Pattern Mining; HDminer; High Dimensional Data;
fLanguage
English
Publisher
ieee
Conference_Titel
Data Mining Workshop (ICDMW), 2014 IEEE International Conference on
Conference_Location
Shenzhen
Print_ISBN
978-1-4799-4275-6
Type
conf
DOI
10.1109/ICDMW.2014.59
Filename
7022714
Link To Document