• DocumentCode
    3515530
  • Title

    Discovering closed frequent itemsets on multicore: Parallelizing computations and optimizing memory accesses

  • Author

    Negrevergne, Benjamin ; Termier, Alexandre ; Méhaut, Jean-François ; Uno, Takeaki

  • Author_Institution
    Lab. d´´Inf. de Grenoble, Grenoble, France
  • fYear
    2010
  • fDate
    June 28 2010-July 2 2010
  • Firstpage
    521
  • Lastpage
    528
  • Abstract
    The problem of closed frequent itemset discovery is a fundamental problem of data mining, having applications in numerous domains. It is thus very important to have efficient parallel algorithms to solve this problem, capable of efficiently harnessing the power of multicore processors that exists in our computers (notebooks as well as desktops). In this paper we present PLCMQS, a parallel algorithm based on the LCM algorithm, recognized as the most efficient algorithm for sequential discovery of closed frequent itemsets. We also present a simple yet powerfull parallelism interface based on the concept of Tuple Space, which allows an efficient dynamic sharing of the work. Thanks to a detailed experimental study, we show that PLCMQS is efficient on both on sparse and dense databases.
  • Keywords
    data mining; parallel algorithms; PLCMQS; Tuple space; closed frequent itemsets; data mining; multicore processors; optimizing memory accesses; parallel algorithms; Data mining; Itemsets; Multicore processing; Program processors; frequent closed itemset; memory accesses; multicore; pattern mining;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    High Performance Computing and Simulation (HPCS), 2010 International Conference on
  • Conference_Location
    Caen
  • Print_ISBN
    978-1-4244-6827-0
  • Type

    conf

  • DOI
    10.1109/HPCS.2010.5547082
  • Filename
    5547082