DocumentCode :
243752
Title :
HDminer: Efficient Mining of High Dimensional Frequent Closed Patterns from Dense Data
Author :
Jianpeng Xu ; Shufan Ji
Author_Institution :
Dept. of Comput. Sci. & Eng., Michigan State Univ., East Lansing, MI, USA
fYear :
2014
fDate :
14-14 Dec. 2014
Firstpage :
1061
Lastpage :
1067
Abstract :
Frequent closed pattern mining has been developed for decades, mostly on a two dimensional matrix. This paper addresses the problem of mining high dimensional frequent closed patterns (nFCPs) from dense binary dataset, where the dataset is represented by a high dimensional cube. As existing FP-tree or enumeration tree based algorithms do not suit for n-dimensional dense data, we are motivated to propose a novel algorithm called HDminer for nFCPs mining. HDminer employs effective search space partition and pruning strategies to enhance the mining efficiency. We have implemented HDminer, and the performance studies on synthetic data and real microarray data show its superiority over existing algorithms.
Keywords :
data mining; search problems; trees (mathematics); FP-tree; HDminer; dense binary dataset; enumeration tree based algorithm; frequent closed pattern mining; high dimensional cube; high dimensional frequent closed pattern; mining efficiency; n-dimensional dense data; nFCP mining; pruning strategy; real microarray data; search space partition; synthetic data; two dimensional matrix; Biology; Data mining; Data processing; Educational institutions; Out of order; Partitioning algorithms; Three-dimensional displays; Frequent Pattern Mining; HDminer; High Dimensional Data;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Mining Workshop (ICDMW), 2014 IEEE International Conference on
Conference_Location :
Shenzhen
Print_ISBN :
978-1-4799-4275-6
Type :
conf
DOI :
10.1109/ICDMW.2014.59
Filename :
7022714
Link To Document :
بازگشت