Title :
Distributed Frequent Closed Itemsets Mining
Author :
Liu, Chun ; Zheng, Zheng ; Cai, Kai-Yuan ; Zhang, Shichao
Author_Institution :
Sch. of Autom. Sci. & Electr. Eng., Beijing Univ. of Aeronaut. & Astronaut., Beijing
Abstract :
As many large organizations have multiple data sources and the scale of dataset becomes larger and larger, it is inevitable to carry out data mining in the distributed environment. In this paper, we address the problem of mining global frequent closed itemsets in distributed environment. A novel algorithm is proposed to obtain global frequent closed itemsets with exact frequency and it is shown that the algorithm can determine all the global frequent closed itemsets. A new data structure is developed to maintain the closed itemsets. Then an efficient implementation is provided based on the structure. Experimental results show that the proposed algorithm is effective.
Keywords :
data mining; data mining; distributed frequent closed itemsets mining; multiple data sources; Association rules; Automation; Data mining; Data structures; Explosions; Frequency; Internet; Itemsets; Space technology; Transaction databases; Data mining; Data streams; Frequent closed itemsets;
Conference_Titel :
Signal-Image Technologies and Internet-Based System, 2007. SITIS '07. Third International IEEE Conference on
Conference_Location :
Shanghai
Print_ISBN :
978-0-7695-3122-9
DOI :
10.1109/SITIS.2007.64