DocumentCode :
2019301
Title :
Mining Frequent Closed Itemsets from Distributed Dataset
Author :
Ju, Chunhua ; Ni, Dongjun
Author_Institution :
Coll. of Comput. & Inf. Eng., Zhejiang Gongshang Univ., Hangzhou
Volume :
1
fYear :
2008
fDate :
17-18 Oct. 2008
Firstpage :
37
Lastpage :
41
Abstract :
In this paper we address the problem of mining frequent closed itemsets in a highly distributed setting. The extraction of distributed frequent (close) itemsets is an important task in data mining. The paper shows how frequent closed itemsets, mined independently in each site, can be merged in order to derive globally frequent closed itemsets. Unfortunately, as distributed setting is various, it is unreasonable to adopt only one mining approach. The paper analyzes the distributed setting from three perspectives: 1 communication bandwidth; 2 site quantity; 3 the characteristic of each site dataset, and presents two mining approaches and their algorithms in their corresponding distributed setting. The experiment results indicate that our algorithms are efficient to their corresponding distributed setting.
Keywords :
data mining; distributed algorithms; distributed databases; data mining; distributed database; frequent closed itemset mining algorithm; Algorithm design and analysis; Association rules; Bandwidth; Computational intelligence; Data engineering; Data mining; Design engineering; Distributed computing; Educational institutions; Itemsets; distributed mining; distributed setting; frequent closed itemsets;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computational Intelligence and Design, 2008. ISCID '08. International Symposium on
Conference_Location :
Wuhan
Print_ISBN :
978-0-7695-3311-7
Type :
conf
DOI :
10.1109/ISCID.2008.24
Filename :
4725552
Link To Document :
بازگشت