DocumentCode
2757478
Title
Distributed Frequent Closed Itemsets Mining
Author
Liu, Chun ; Zheng, Zheng ; Cai, Kai-Yuan ; Zhang, Shichao
Author_Institution
Sch. of Autom. Sci. & Electr. Eng., Beijing Univ. of Aeronaut. & Astronaut., Beijing
fYear
2007
fDate
16-18 Dec. 2007
Firstpage
43
Lastpage
50
Abstract
As many large organizations have multiple data sources and the scale of dataset becomes larger and larger, it is inevitable to carry out data mining in the distributed environment. In this paper, we address the problem of mining global frequent closed itemsets in distributed environment. A novel algorithm is proposed to obtain global frequent closed itemsets with exact frequency and it is shown that the algorithm can determine all the global frequent closed itemsets. A new data structure is developed to maintain the closed itemsets. Then an efficient implementation is provided based on the structure. Experimental results show that the proposed algorithm is effective.
Keywords
data mining; data mining; distributed frequent closed itemsets mining; multiple data sources; Association rules; Automation; Data mining; Data structures; Explosions; Frequency; Internet; Itemsets; Space technology; Transaction databases; Data mining; Data streams; Frequent closed itemsets;
fLanguage
English
Publisher
ieee
Conference_Titel
Signal-Image Technologies and Internet-Based System, 2007. SITIS '07. Third International IEEE Conference on
Conference_Location
Shanghai
Print_ISBN
978-0-7695-3122-9
Type
conf
DOI
10.1109/SITIS.2007.64
Filename
4618757
Link To Document