• DocumentCode
    2019301
  • Title

    Mining Frequent Closed Itemsets from Distributed Dataset

  • Author

    Ju, Chunhua ; Ni, Dongjun

  • Author_Institution
    Coll. of Comput. & Inf. Eng., Zhejiang Gongshang Univ., Hangzhou
  • Volume
    1
  • fYear
    2008
  • fDate
    17-18 Oct. 2008
  • Firstpage
    37
  • Lastpage
    41
  • Abstract
    In this paper we address the problem of mining frequent closed itemsets in a highly distributed setting. The extraction of distributed frequent (close) itemsets is an important task in data mining. The paper shows how frequent closed itemsets, mined independently in each site, can be merged in order to derive globally frequent closed itemsets. Unfortunately, as distributed setting is various, it is unreasonable to adopt only one mining approach. The paper analyzes the distributed setting from three perspectives: 1 communication bandwidth; 2 site quantity; 3 the characteristic of each site dataset, and presents two mining approaches and their algorithms in their corresponding distributed setting. The experiment results indicate that our algorithms are efficient to their corresponding distributed setting.
  • Keywords
    data mining; distributed algorithms; distributed databases; data mining; distributed database; frequent closed itemset mining algorithm; Algorithm design and analysis; Association rules; Bandwidth; Computational intelligence; Data engineering; Data mining; Design engineering; Distributed computing; Educational institutions; Itemsets; distributed mining; distributed setting; frequent closed itemsets;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computational Intelligence and Design, 2008. ISCID '08. International Symposium on
  • Conference_Location
    Wuhan
  • Print_ISBN
    978-0-7695-3311-7
  • Type

    conf

  • DOI
    10.1109/ISCID.2008.24
  • Filename
    4725552