Title :
Efficiently Mining Maximal 1-complete Regions from Dense Datasets
Author :
Bian, Haiyun ; Bhatnagar, Raj
Author_Institution :
Dept. of ECECS, Cincinnati Univ., OH
Abstract :
We propose a new search algorithm for maximal 1-complete regions in high dimensional dense binary datasets. A maximal 1-complete region indicates a potential functional cluster, and it is mathematically equivalent to a formal concept and also a closed itemset. Our algorithm is designed for dense datasets, where the percentage of 1´s in the dataset is higher than 10%, and the total number of maximal 1-complete regions is much larger than the number of objects in the dataset. Our algorithm is memory efficient and unlike other closed set mining algorithms, it does not require all patterns mined so far to be kept in the memory. We show that our algorithm has solid theoretical foundations, and it is also very time efficient compared with other existing algorithms
Keywords :
data mining; binary datasets; dense dataset mining; formal concept; maximal 1-complete regions; search algorithm; Algorithm design and analysis; Association rules; Clustering algorithms; Data mining; Frequency; Gene expression; Itemsets; Machine learning; Signal to noise ratio; Solids;
Conference_Titel :
Data Mining Workshops, 2006. ICDM Workshops 2006. Sixth IEEE International Conference on
Conference_Location :
Hong Kong
Print_ISBN :
0-7695-2702-7
DOI :
10.1109/ICDMW.2006.69